Journal of 


Experimental Psychology 


a — belong tee Editor 
epartment of Psyc , Universit Michi 
Ann yr Michigan’ of ai 
Davin A. Grant, Associate Editor Detos D. Wickens, Associate Editor 
University of Wisconsin Ohio State University 
Consulting Editors 
E. James Ancuer, University of Wisconsin Liorvp G. Humpuneys, University of Illinois 
Jupson S. Brown, State University of lowe Antuur L. Inion, Tulone University 


Ciztus J. Bunge, Indiana University Howazp H. Kenpizr, New York University 


Paut M. Fitts, Ohio State University 
Cc Donarp B. Linpsuey, 


Kanon ny University of California, Los Angeles 
Pl a —1cre pon aetad jc Kenwetnu MacCorguopare, University of Minnesota 
Jauzs J. Grason, Cornell University Quinn McNemar, Stanford University 
Crassnce H. Geanam, Columbia University Neat E. Mitize, Yale University 
Harzotp W. Haazz, University of Illinois Kennern W. Srencez, State University of lowe 


Editorial Staff: Sapm J. Dovzz, Frances H. Crarx, Barsana Cummincs, Saran Womack 





CONTENTS 


Probabilistic ag 
W. K. Estes, C. J. Bunze, R. C. Arxinson, ann J. P. Franxmann 233 
Statistical Learning Theory Applied to an Instrumental Avoidance Situation: A. L. Bropy 240 
The Length-Difficulty Relationship in Compound Trial-and-Error Learning: C. E. Nosiz 246 
Effect of Pattern Variation Upon Verbal Maze Learning: 

R. F. Tuompson, J. F. Voss, anp W. J. Brocpen 253 
ee re eae en aan H. P. Bauaicx 


Force and Perf in T 
ap. 2 OS Bee a Pre fers, ono “Damen :.... he ee 


siallete ai Mlie "al ao csoe ce eaten stake eed Wcdoeecs tone a 
Associate Learning: G. Manpizr ann E. H. Campseie peas 


Studies of Distributed Practice: XVII. Interlist Interference and the Retention of Paired 
Consonant Syllables: B. J. Usspznwoop anp J. RicHAxpson 


Secondary Extinction of Lever-Pressing Behavior in the Albino Rat: W. W. Rozepoom .. 
Error Reinforcement in a Modified Serial Perceptual-Motor Task: 
M. H. Marx anp R. A. Gowppecx 288 


ee cen Setetel Recprasts Reweittg Foun Enedes Hand Moye- 
ment: G. Doznan 


Problem Solving in Multiple-Goal Situations: S. B. Anperson 





American Psychological Association 
Vol. 54 No. 4 October 1957 





JOURNAL OF EXPERIMENTAL PSYCHOLOGY 


i eS eee © published monthly, 
the American Psychological Association, Inc., 

1g F macnn | ply Kh »Pa. The rate per volume 
is $8.00, or 816.00 annually." Single Single copies are $1.50. orders, 
address changes, and Sena enatideentons should be addressed to the 
American Psychological Association, Inc., 1333 Sixteenth St. N.W., Wash. 
ig, 

This Journ 
intended io emia tae 
mental science. Studies with normal human 
studies involving abnormal or animal subj 
specifically oriented toward the extension i . Experimen 
Sak tata ie in psychology or 


experimen 
neering = spol 
for experimental and 
Normally, cannot be accepted. 


-experiment reporting of 
of the data and 


, an integra 
a die f en ; blicati ~ of soa ne 
ae is or publication tary 
cation Reports in articles of not more than 1.5 printed pages (see Editor's 
Note, this JourNAL, 1957, 53, 1-2). 

Address all articles submitted for publication to the editor, Arthur W. 
Melton, Department of Psy niversity of Michigan, Ann Arbor, 
a op All womaonaie should be submitted in two complete 

rw Be nu Figure may be orginal 2 a at tie aed for 
publication ures may ne rtm. ch or otographic 

nts, but should not exceed 84 by 11 in. og B coreg sa 

ibility of labels and critical features of Pi ad aon betes to a single- 
umn or (exceptionally) a double-column figure in the JOURNAL. 

To save — costs supplementary material to articles in this JOURNAL 
pee er mes with the American Documentation Institute. Copies may be 

ee eee ordering the Document Number given in the footnote reference 
to ADI. should be sent to ADI comt Waltene ss. B Project, 
Photoduplication Service, Library of C ashington 25, Cc. 
Siedate at Go sosetiamnllll Cy thaake or eet oolte oagehle to the 
Chief, Photoduplication Service, Library of Congress. 

Manuscripts submitted to the JOURNAL must adhere to the conven- 

of tables and 


manuscript footnotes, tables (one to a page), 
nr winemaker gh peg 2 tart Age am All typed material must 
be double-spaced with ample margins. The term “subject,” with its various 
forms, is abbreviated S, Ss, S’s, and Ss’; analogous abbreviations are used 
for “experimenter” (£) and “observer” (O). When in doubt about the 
practices of the JOURNAL, authors should examine a recent issue. 


peeks, Te Aco era: Daling’at "te acl root postu provid it la Sars 
a40 


SRL Te i iar bs a 


be made within four 
seen © 1957 by the American Paychological Association, Inc. 








Journal of 


Experimental Psychology 








VoL. 54, No. 4 


Ocrosper, 1957 








PROBABILISTIC DISCRIMINATION LEARNING! 


W. K. ESTES, C. J. BURKE, R. C. ATKINSON,? AND J. P. FRANKMANN 


Indiana University 


Traditionally, discrimination ex- 
periments have been limited to cases 
in which two situations to be discrimi- 
nated differ with respect to some or 
all of their component stimuli or 
stimulus attributes (henceforth re- 
ferred to as cues). Analyses of dis- 
crimination learning in terms of 
statistical learning theory (1, 4) have 
led to the study of a more general case 
which may be termed probabilistic 
discrimination learning. According 
to the theory, there is a basis for 
discrimination learning whenever 
some of the cues in a situation are 
differentially correlated with rein- 
forcing events. Classical discrimi- 
nation learning arises if some of the 
correlations are unity and others zero; 
probabilistic discrimination learning 
arises if the correlations have inter- 
mediate values. Otherwise stated, 
the condition for probabilistic dis- 
crimination learning is that differ- 
ential reinforcement be given relative 
to two situations which include the 
same cues and differ only with respect 
to the sampling probabilities of the 
cues. 


! This research was supported in part by a 
grant to W. K. Estes and C. J. Burke from the 
National Science Foundation. 

* Now at U.C.L.A. 


To illustrate these distinctions, 
consider a standard two-choice dis- 
crimination apparatus (e.g., a Grice 
box or a Lashley stand) with two 
discriminative cues available, say a 
light andatone. ‘To set up a classical 
discrimination problem, one might 
present either the light or the tone on 
each trial and reinforce responses to 
the left door when the tone is present 
and responses to the right door when 
the light is present. To set up a 
probabilistic discrimination problem, 
one might assign to the tone and light 
the probabilities .8 and .4, respec- 
tively, of occurring on trials when the 
left response is reinforced and the 
probabilities .4 and .8, respectively, 
of occurring on trials when the right 
response is reinforced. Now, either 
the tone or the light or both may 
occur on any trial, yet there is a basis 
for better than chance discrimination. 
According to the theory (1, 4), 
learning relative to the individual 
cues should proceed exactly as in 
simple experiments on acquisition 
with random reinforcement. Thus 
the asymptotic probability of a given 
response to any cue taken alone should 
approximate the conditional proba- 
bility of reinforcement in the presence 
of the cue. Using this principle, 
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together with the compounding laws 
of the statistical model (1, 3, 7), it 
should be possible in advance of an 
experiment to predict the asymptotic 
frequency of correct responding and 
the asymptotic probability of any 
given response in the presence of any 
specified combination of cues. For 
the illustrative example given above, 
it would be predicted, employing 
formulae given in a later section, that 
asymptotically the relative frequency 
of correct responses would be .56, the 
probability of a left response to the 
tone alone would be .67, the proba- 
bility of a left response to the tone 
and light together would be .5, and 
so on (these computations assume 
equal frequencies of reinforcement on 
the left and right). 

The experiment to be described 
was conducted to determine whether 
a probabilistic discrimination would 
develop under conditions prescribed 
by the statistical theory and to test 
quantitative predictions concerning 
asymptotic response probabilities. A 
modified verbal conditioning situation 
offered several advantages for these 
purposes. Previous studies (2, 5, 6) 
had established that simple acquisi- 
tion functions for verbal conditioning 
are described by the statistical model, 
and the desired stimulus manipu- 
lations could be introduced without 
changing other aspects of the experi- 
mental situation. 


MetTHOoD 


Apparatus.—The apparatus has been de- 
scribed in detail elsewhere (5). The experi- 
mental room was dark, except for light coming 
from the apparatus, and contained four booths 
and a signal panel. On the panel were mounted 
12 12-v., 25-amp. light bulbs, evenly spaced 
around a circle of 9-in. radius. ‘The booths were 
placed so that all Ss would have a clear view of 
the signal panel at a distance of approximately 
7 ft. Within each booth was a pair of telegraph 
keys, each key directly beneath a reinforcing 
light. ‘The presentations of signal patterns and 


reinforcing stimuli were controlled by an auto- 
matic programming device and the responses of 
Ss were recorded automatically. 

Subjects.—The Ss were 32 non-psychology 
students obtained from language courses in the 
Indiana University summer school of 1954 and 
were paid for their services. They were 
scheduled in subgroups of four for the experi- 
mental sessions. 

Design.—For each S, one of the reinforcing 
lights was designated EZ, and the other £2, the 
side of the E, light being counterbalanced within 
each subgroup of four Ss. During discrimina- 
tion training, either EZ, or E2 appeared following 
the signa! on each trial; the two types of trials 
will be designated 7, and 72, respectively. On 
test trials neither of the reinforcing lights 
appeared. 

The stimulating situations obtaining on 
training trials of types 7; and 7; were arranged 
as follows. First each of the 12 signal lights on 
the panel was assigned a number between | and 
12 by a random procedure. Then for 7; trials 
the lights in order from 1 to 12 were assigned the 
probabilities 5/78, 10/78, . . . , 60/78 of being 
sampled on any one trial; for 72 trials the lights, 
in the same order, had sampling probabilities of 
60/78, 55/78, eves 5/78. In other words, a 
signal light which had been assigned the ordinal 
number i appeared on 7; trials with probability 
5i/78 and on 72 trials with probability 
5(13 — i)/78, as illustrated in the upper panel 
of Fig. 1. 

During discrimination training, 7; and T; 
trials occurred in random sequence with the 
restriction that for one treatment group of 16 
Ss the two types occurred equally often within 
each period and for the other treatment group 
of 16 Ss the proportions were .25 7; to .75 7%. 
A new random assignment of numbers to signal 
lights was made for each subgroup of four Ss in 
the .5 treatment group and its matched subgroup 
in the .25 group. The conditional probability 
of E, in the presence of each individual signal 
light for each group is shown in the lower panel 
of Fig. 1. 

On Days 1 and 2 all groups received 240 
training trials. On Day 3 all groups had first a 
block of 40 additional training trials, bringing 
the total to 520, then four test blocks of 36 trials 
each. Each test block included 18 training 
trials and 18 test trials given in a random order. 
Each set of 18 test trials included one test on 
each of the 12 signal lights singly, one test on 
each of the successive triads (1, 2, 3; 4, 5, 6; 
7, 8, 9; 10, 11, 12), and one test each on the 
mixed triads (1, 3, 11; and 2, 10, 12). 

Procedure.—At the first session Ss were 
seated in the booths and read the instructions 
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Fic. 1. Upper panel: Sampling probabilities 
of stimulus components (individual signal lights) 
on T; and T; trials (i.e., trials on which rein- 
forcing events E; and E; occur). Lower panel: 
Conditional probabilities of E, in the presence of 
individual signal lights for the two treatment 
groups. 


which follow (omitting some repetitious pas- 
sages): 


“IT want to see how well you can do in a 
rather unusual problem situation. At the 
beginning of each trial some lights will appear 
on this panel. About a second later, either the 
left or the right lamp in your compartment will 
light for a moment. As soon as the signal 
pattern flashes on the panel, you are to show 
whether you expect the left or the right lamp in 
your compartment to light on that trial by 
pressing the proper key .... If you are not 
sure, guess—your guesses or hunches may turn 
out to be right, and it is important to have a 
complete record of your learning. 

“Please try not to feel frustrated or irritated if 
you cannot understand what the experiment is all 
about. Trying to figure out what the experi- 
menter is up to is not part of your task, and 
would only tend to spoil your record... . 
The most important thing you can do to ensure 
the success of the experiment is to indicate as 
accurately and candidly as possible on each trial 
which of the compartment lights you expect to 
appear.” 


After four practice trials and review of the 
instructions, the 240 training trials were run off 
without interruption. ‘Time relations within 
training trials were approximately: 2-sec. signal 
duration; 2-sec. delay between signal and rein- 
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forcement; I-sec. duration of reinforcing light; 
4-sec. intertrial interval. 

On Day 2, the instructions were reviewed and 
240 additional training trials were given under 
the same conditions. 

On Day 3, the original instructions were 
reviewed and the following added: 


“Up to now, either the left or the right lamp 
in your compartment has flashed on each trial 
to show you which key was the correct response 
to the pattern shown on the panel. During 
today’s session there will be some trials on which 
neither of the compartment lights will flash. 
These test trials will be included to see if you 
can keep up the same level of accuracy when 
you are not given information as to whether 
you are right or wrong on every trial. Aside 
from this one feature, the situation will not be 
changed in any way, and you are to continue 
making your best guess on every trial.” 


Then the 72 test trials and 112 additional 
training trials were given in the order described 
above. On test trials the signal duration and 
intertrial interval were the same as on training 
trials. 


Resutts ANv Discussion 


Statistical analyses.—The key-pres- 
sing responses corresponding to S’s 
predictions of E, and FE, will be 
designated as A, and Ag, respectively. 
In analyses of discrimination learning, 
the dependent variable will be the 
mean proportion of correct responses 
per 40-trial block for a given group; 
a correct response is defined as an A; 
response on a T, trial or an Ay, re- 
sponse on a T; trial. In analyses of 
asymptotic probabilities of response 
to single lights and to compounds, the 
dependent variable will be the mean 
proportion of A, responses for a given 
group on all test trials of a given type. 
Statistical analyses employed F or ¢ 
tests whenever possible; when vari- 
ances could not be stabilized by an 
arcsin transformation of the scores, 
nonparametric rank tests (8) were 
used. Statistical significance was de- 
fined as the .05 level for a two-tailed 
t test and the equivalent for other 
tests. 
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Fic. 2. Theoretical and empirical curves of 
probabilistic discrimination learning for the two 
treatment groups. (Note:—the last point on 
each curve represents only 32 trials.) 


Empirical curves of discrimination 
learning for the two treatment groups 
are plotted in Fig. 2. Acquisition 
proceeded at about the same rate in 
the two groups and was virtually 
complete by the end of the first 
period ; both curves appear essentially 
horizontal for the last 400 trials. To 
estimate the asymptotic probabilities 
of correct responding, the proportions 
over the last 40 trials of each period 
have been averaged. These mean 
asymptotic proportions are given in 
Table 1; the group difference is 
significant by a rank test, and both 
group means differ significantly from 
5 by ¢ tests. Further, both means 
fall substantially below the values 
that could be achieved by statistically 
optimal “‘strategies.’”* 


*The optimal strategy would require S to 
respond to every signal pattern with the response 
having the higher conditional probability of 
reinforcement (i.e., the higher probability of 
being “correct”). ‘The task of calculating exact 
expected proportions of success for the optimal 
strategy is beyond our computational resources. 
However, by analyzing statistically the actual 
signal-reinforcement sequences which were 
generated for the present Ss, it is estimated that 
these maximal proportions are greater than .90 
for both of the present groups. 


TABLE 1 


Asymptotic Proportions oF 
Correct Responses 





Observed 
Proportion 


Theoretical 
Proportion 


| 67 64 1 


Group t 





25 74 72 74 








Tests of response probability to 
single lights yielded the values plotted 
in Fig. 3; response proportions have 
been subjected to the arcsin trans- 
formation in order to stabilize the 
variances. Itcan be seen that for each 
group, response probability is directly 
related to and, except at the extreme 
values, virtually equal to the condi- 
tional probability of reinforcement. 
An analysis of variance on the trans- 
formed scores yielded significant F’s 
for differences between Lights and 
between Groups. (The mean square 
for Ss-by-Lights interaction was used 
as the error term in the former case, 
and the between-Ss mean square in 
the latter.) 

Response proportions obtained on 
the tests with various triads of lights» 
are summarized in Table 2. The 
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Fic. 3. Predicted and observed probabilities 
of response to individual signal lights for the two 
treatment groups. 
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TABLE 2 


Proportions or 4A; Responses to Test Triaps 


Proportions 


5 Group 


2, 10, 12 
10, 11, 12 





triads have been arranged in order of 
the conditional probabilities of rein- 
forcement, and it can be seen that 
except for one inversion of an adjacent 
pair, the observed values fall in the 
same order for both groups. Group 
differences are significant, by both 
t and F tests, for the first two triads, 
and insignificant for the remainder. 
Within each group, differences be- 
tween values for successive triads two 
or more rows apart are all significant 
by paired rank tests. 


Tests of theoretical predictions.—The 
theoretical functions required for quan- 
titative predictions of the present data 
have been derived in detail in another 
paper (1); here the functions will be 
simply specialized by inserting the par- 
ticular parameter values called for by 
the experimental conditions. 

In this study the sampling probability 
of a stimulus element associated with a 
signal light of ordinal number # is given 
by 

Bin = O(an + df), (1) 
where A equals 1 or 2 according to 
whether the trial in question is of type 
T, or T.; the parameters a, and 4, are 
experimenter-determined, but the value 
of @ must be estimated from data for a 
particular S or group of Ss. Inserting 
the appropriate experimental parameters 
and averaging, it is found that for both 


.25 Group 


Arcsin Transformations 


5 Group .25 Group 


Obs. 


ov 
4 
7 
30 
38 
38 | 


65 





types of trials 
6, = 50/12. 


The theoretical probability of A, re- 
sponses on trials of type T) is given by 


(2) 


1 
Pri(n) = Nb, % 80m), 


where N is the number of cues and F,(») 
is the probability that the ith cue is 
conditioned to response A, after n trials. 
The latter, in turn, is given by 


F,(n) = Fi(2) — (Fi() — F,(0)] 

x (1 — 64 — Obi)". (3) 
Under the present experimental condi- 
tions, the asymptotic value 
expressed as 


can be 


Bi ( 
(1 — B)13 + (28 — 1)i 
where 6 represents the relative frequency 
of T, trials and equals .5 and .25 for our 


two treatment groups. Also, one has, 
for the present conditions, 


ad = (1 — B)5/6 


Fi(2) = 4) 


and 
56 = (28 — 1)5/78. 


Assuming the initial probabilities F,(0) 
equal to .5, and estimating 6 from ob- 
tained total response frequencies as in 
other studies (2, 5), Equation 2 has been 
computed for the two treatment groups. 
The theoretical curves have been plotted 
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together with the empirical values in 
Fig. 2. The values of 6 are .05 and .03 
for the B= .5 and B = .25 groups, 
respectively. The fits appear reasonably 
satisfactory, considering that each curve 
has only one free parameter and that the 
theoretical asymptotes, depending solely 
on experimenter-determined parameters, 
were predictable in advance of the ex- 
periment. Moreover, the estimated val- 
ues of @ are consistent with the values 
obtained by similar estimation pro- 
cedures in studies of simple random 
reinforcement (2, 5, 6) and used to 
predict the data of a previous discrimi- 
nation experiment (4). Whether the 
apparently systematic deviations of em- 
pirical points from the curves on the 
first few trial blocks of Day 1 and on the 
terminal blocks of Days 1 and 2 represent 
a shortcoming of the theory or merely an 
effect of the particular stimulus se- 
quences used cannot be decided on the 
basis of the present data. A numerical 
comparison of predicted and observed 
asymptotic values is summarized in 
Table 1. 

Predicted asymptotic probabilities of 


the A; response to single lights are given 
by Equation 4 and illustrated in the 
lower panel of Fig. 1; it should be noted 
that these values are also the conditional 
probabilities of reinforcing event FE, in 


the presence of the single lights. The 
predicted values are compared with 
corresponding observed values in Fig. 3, 
both having been subjected to an arcsin 
transformation. Throughout the middle 
range, the correspondence of predicted 
and observed values seems about as good 
as could be expected in view of the sizes 
of our experimental groups. At both 
extremes the observed points deviate in 
the direction of .5 from the theoretical 
curves. It may be noted that this is the 
pattern of deviation to be expected if 
there are some background cues in the 
experimental situation which contribute 
to response probability on all trials. By 
appropriate analyses one could estimate 
the weight associated with the back- 
ground cues and allow for it in the theo- 
retical functions, thus improving the fits 
of the curves. However, in this study 


the plan was to deal with theoretical 
curves predicted in advance from the 
experimental parameters alone. 

For the test of response probability to 
selected triads of lights, the predicted 
values are obtained by simply averaging 
the appropriate F; values. Consider, 
for example, the triad of Lights 1, 2, 3. 
Using Equation 4, one obtains 1/13, 
2/13, and 3/13 as the three predicted 
values of F;(*) for the group which had 
B =.5. Averaging these, one obtains 
2/13 = .15, which is the first entry in the 
column of theoretical proportions for the 
.5 group in Table 2. The theoretical 
values thus obtained for all triads tested 
are compared with the corresponding 
observed values in Table 2, each pair of 
values being presented on the left side of 
the table as proportions and on the right 
side as angles (arcsin transforms). Of 
the ¢ tests of differences between theo- 
retical and observed transformed values, 
those for the first two triads in the .25 
group were significant; none of the other 
t’s for the .25 group and none of the ?’s 
for the .5 group had P values less than 
.20. On the whole, the accuracy with 
which observed probabilities of response 
to compounds are predicted in this study 
appears comparable to that achieved 
by Schoeffler (7) for similar tests con- 
ducted following classical discrimination 
learning. 


SUMMARY 


From statistical learning theory it can be 
predicted that a new experimental phenomenon, 
“probabilistic discrimination learning,” will be 
observed if two situations to be discriminated 
include the same cues and differ only with respect 
to the sampling probabilities of the cues. Just 
as partial reinforcement represents a natural 
generalization of simple acquisition and extinc- 
tion procedures, probabilistic discrimination 
learning represents a natural generalization of 
the conventional discrimination paradigm. 

For a test experiment, the two “situations” 
to be discriminated were constructed by as- 
signing two different patterns of sampling proba- 
bilities to a set of signal lights, and a key-pressing 
response was given differential reinforcement 
relative to these “situations.” ‘Two groups of 
human Ss received 592 training trials, under two 
different reinforcement schedules, and 72 test 
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trials designed to evaluate asymptotic response 
probabilities to single cues. 

For both groups, curves of correct response 
proportions approached relatively stable ter- 
minal levels intermediate between chance 
expectations and the maximum values that 
could be attained by optimal “strategies” under 
the conditions of the experiment. Rates of 
probabilistic discrimination learning were very 
similar to those observed previously for learning 
under simple partial reinforcement in the same 
situation. ‘Test trial data confirmed the theo- 
retical prediction that asymptotically the 
probability of response to any single cue would 
approach the conditional probability of rein- 
forcement in the presence of the cue. 
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STATISTICAL LEARNING THEORY APPLIED TO AN 
INSTRUMENTAL AVOIDANCE SITUATION ! 


ARTHUR L. BRODY? 


Indiana Uniersity 


In this study an attempt will be 
made to extend the statistical learning 
theory developed by Estes and others 
(2, 3, 4, 6, 8) to an avoidance situ- 
ation, and to test some hypotheses 
generated by this extension. The 
essential condition of avoidance learn- 
ing is that S’s correct response leads 
to the nonoccurrence of an event. 

In the present situation, human Ss 
are given a continuous tracking task 
with one light providing the only 
illumination for the target. A signal 
tone is presented every 10 sec. at the 
start of each trial in a series of discrete 
trials. Following the onset of the 
signal, S either turns a knob in one 
direction (Response A,), turns the 
same knob in the opposite direction 
(Response A,), or fails to turn the 
knob at all. On a given trial one 
positive response, A, or Ag, is desig- 
nated the correct avoidance response, 
and one response, A; or Ag, is desig- 
nated the correct escape response. 


The correct avoidance response may 
or may not be the same as the correct 


escape response. If S makes the 
correct avoidance response within 1.8 
sec. of the onset of the signal tone, the 
light on the target remains on, per- 
mitting S to continue tracking. In 
other words, darkness is avoided. 
The abbreviation, D, will be used for 
darkness. If S fails to make the 


'This study is part of a dissertation sub- 
mitted to the Department of Psychology of 
Indiana University in partial fulfillment of the 
requirements for the Ph.D. degree in psychology. 
The writer is greatly indebted to Professor W. K. 
Estes, under whose direction the investigation 
was conducted. 

* Now at Lehigh University. 


correct avoidance response within 1.8 
sec., D occurs. On any such non- 
avoidance trial, S turns the light back 
on by making the correct escape 
response. A correction procedure is 
followed on nonavoidance trials, and 
D persists until S makes the correct 
escape response. A trial, then, is 
either terminated by a correct avoid- 
ance response or, when nonavoidance 
occurs, by a correct escape response. 

Both the acquisition of avoidance 
behavior and the final level of learning 
are to be studied. The first variable 
selected which might have an effect 
on learning is the proportion of trials 
on which a given response (a) leads to 
avoidance, and (b) leads to escape. 
In previous studies one response has 
led to avoidance on all trials, and, 
where escape was permitted (5, 9), 
one response led to escape on all 
trials. In this situation a response, 
A, or Ay, may lead to avoidance or 
escape with any specified probability. 
On any trial, an A, is followed by the 
avoidance of D with probability, 7, 
and by the nonavoidance of D with 
probability, 1 — ,. Similarly, an 
A, leads to avoidance with proba- 
bility, m2, and to nonavoidance with 
probability, 1 — mw, An A, is fol- 
lowed by escape from D on a propor- 
tion, ¢, of the nonavoidance trials, 
and since a correction procedure is 
followed, an A, leads to escape on a 
proportion, 1 — ¢, of the nonavoid- 
ance trials. There will, then, be 
partial reinforcement at two points. 
A given response may lead to avoid- 
ance less than 100% of the time and, 
on nonavoidance trials, a given re- 
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sponse may lead to escape less than 
100% of the time. 

A second variable is the method of 
terminating the signal tone. Under 
one condition, called the contingent 
procedure, the signal is terminated 
by S’s correct response. Under the 
second, or noncontingent procedure, 
the signal is terminated by E 1.8 sec. 
after its onset and is not dependent 
upon S’s response. 

Estes’ statistical learning theory 
has been developed from experiments 
in areas other than avoidance learn- 
ing. This avoidance situation, how- 
ever, is similar to the experiments on 
simple human learning (4, 6, 8) in all 
essentials except the relationship be- 
tween S’s response and the nonoccur- 
rence of D. The theory makes the 
assumption that the stimulus situ- 
ation present at the start of each trial 
determines a population of stimulus 
elements. A proportion, 6, of this 
population is sampled on any one 
trial. It follows from the assumption 
of association by contiguity that these 
sampled elements become connected 
to the response which terminates the 
trial. The probability of a particular 
response is defined as the proportion 
of elements in the total population 
that are connected to that response. 
When a response leads to avoidance, 
it is hypothesized that all the sampled 
elements become connected to that 
response. ‘This results in an increase 
in the probability of the response 
which is followed by avoidance. 
When a response leads to nonavoid- 
ance, it is hypothesized that the status 
of sampled elements is unchanged, but 
when this is followed by a response 
which leads to escape, all the sampled 
elements become connected to that 
response. This results in an increase 
in the probability of the response 
which is followed by escape. If the 
hypotheses are correct, the acquisition 
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of avoidance behavior and the asymp- 
totic level of learning will depend upon 
the relative probabilities with which 
A; and A, lead to avoidance and 
escape, 


Mertuop 


Design.—The study consisted of a 2 4 
factorial design. ‘There were two methods of 
terminating the signal (the contingent and 
noncontingent conditions) and four sets of values 
for m1, m2, and @ (referred to hereafter as event 
schedules), Each of the eight groups was run 
under one of the four event schedules with either 
the contingent or noncontingent method of signal 
termination. Six groups, with Event Schedules 
I, Il, and III under contingent and noncon- 
tingent conditions, were run first and are 
designated Exp. 1. ‘The remaining two groups, 
with Event Schedule IV under contingent and 
noncontingent conditions, were run later and are 
designated Exp. 2. 

The probability values used for the four event 
schedules during a first series of 50 trials and a 
second series of 100 trials are shown in Table 1 
All Ss were given the first series of 50 trials and 
were required to meet a criterion of at least one 
avoidance response. Those who failed to do 
so were discontinued after Trial 50. The Ss 
who met the criterion were given the second 
series of 100 trials. For each S three different 
random and independent sequences of events 
were drawn, corresponding to the values for m, 
2, and @ respectively. 


TABLE 1 
VALUES OF m1, 42, AND @ FOR 


Four Event Scueputes 


Event Schedules 
Values 


If Ill 
Series 1 


1.00 
1.00 


1.00 


1,00 | 
1,00 
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Subjects.—The Ss were 182 male and female 
undergraduate volunteers from the beginning 
course in psychology. They were assigned 
randomly to the experimental groups. The 
study was continued until there were 16 Ss who 
met the criterion of at least one avoidance 
response during Series 1 in each of the eight 
experimental groups. 

Apparatus.—The apparatus was a modifi- 
cation of that used by Straughan (8). The S 
sat facing a gray panel, 35 in. wide by 24 in. 
high, which was mounted on a table top. A 
horizontal slot, approximately at eye level, 
enabled S to view the irregular side-to-side 
movements of a black spot. The moving spot 
was actually a segment of an irregular, ¢,-in.- 
wide, black line drawn on white paper and moved 
past the slot by an Esterline-Angus chart drive. 
The black spot was visible to S only when a 7-w. 
bulb, located above the slot and behind the 
panel, was lighted. A vertical pointer in front 
of the black spot could be moved from side to 
side by means of two rods located below the 
slot. Above the slot was a knob, kept centered 
by a small spring and friction device, which 
could be turned 70° to the left or right before it 
closed a switch. For a randomly selected one- 
half of the Ss, a complete turn of the knob to the 
left was an A, response and a complete turn to 
the right was an A; response. The reverse was 
the case for the remaining Ss. An impure tone, 
approximately 1000 cps, served as the signal. 

Procedure.—Instructions to each S included 
the following points: that the pointer was to be 
kept on the moving spot at all times; that the 
knob controlled the light on the target and could 
be turned a considerable distance in both di- 
rections; that since the pointer could not be 
kept on the target if the target could not be seen, 
the light was to be kept on as much of the time 
as possible; and finally, that it was possible to 
keep the light from going off, but if it did go off, 
to turn it back on as quickly as possible. 

Each S was then given some practice in 
turning the light back on as quickly as possible. 
There were four practice escape trials, during 
which the signal tone was not presented and S 
could not avoid having the light go off. All Ss 
had the same sequence of correct escape re- 
sponses for the four trials—turning the knob to 
the left, right, right, and left in that order. 
After completion of the practice trials, S was 
reminded that during the experiment it was 
possible to keep the light from going off. Then 
the experiment began, with a new trial starting 
every 10 sec. with the presentation of the signal. 
The tracking task was continuous, and there was 
no break in going from the first series of trials 
to the second. 
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The dependent variable was the occurrence 
of an avoidance response. The relative fre- 
quency of avoidance responses on each trial 
constituted the measure of response probability. 
For the first series of 50 trials, since r; and m2 
equaled 1.00, either knob-turning response, A, 
or Ag, always led to avoidance if it occurred 
within 1.8 sec. of the onset of the signal. Both 
avoidance responses were therefore considered 
members of the same response class—avoidance 
responses—and changes in the probability of 
this class were analyzed. For the second series 
of 100 trials there was partial reinforcement of 
each avoidance response. One response class, 
A, avoidance responses, was chosen and changes 
in the probability of this class were studied for 
both series of trials. Separate analyses were 
made of the data from Exp. | and the data from 


Exp. 2. 


Resutts AND Discussion 


Avoidance responses: Series 1.—In 
order to obtain 96 Ss who met the 
criterion of at least one avoidance 
response during the first series in 
Exp. 1, it was necessary to run 135 
Ss. To see whether the 39 Ss who 
failed to make an avoidance response 
were randomly distributed among 
treatment groups, two separate chi- 
square tests were run. Significant 
differences were found among event 
schedules (P = .02) and _ between 
methods of signal termination 
(P = Ol). It would seem that a 
greater proportion of Ss learned to 
make the avoidance response (a) 
when the escape response was always 
the same (Event Schedule I), and 
(b) under the contingent condition of 
signal termination. To obtain 32 Ss 
who met the criterion in Exp. 2, 47 
Ss were run. Of the 15 Ss who failed 
to learn, 7 had been run under the 
contingent and 8 under the noncon- 
tingent condition. 

For Ss who met the criterion in 
Exp. 1, a 2 X 3 analysis of variance 
for the total numbers of avoidance 
responses over the 50 trials yielded no 
significant F’s for the two conditions 
of signal termination, the three event 
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Fic. 2. Proportion of avoidance responses per 
five-trial block in Series 1 for Exp. 2. 


schedules, or the interaction of the 
two variables. The totals for the 
two groups in Exp. 2 were within the 
range of totals for the six groups in 
Exp. 1. 

The learning curves for the eight 
groups of Ss who made at least one 
avoidance response were negatively 
accelerated, and ran from an initial 
value of approximately zero to a final 
value of approximately unity. Figure 
1 shows the learning curve for all Ss 
in Exp. 1 who met the criterion, and 
Fig. 2 shows the learning curve for all 
Ss in Exp. 2 who met the criterion. 
These curves were obtained by uti- 
lizing methods outlined by Estes and 
Straughan (4). The 96 Ss of Exp. 1 
were divided into two groups on the 
basis of learning rate (@ value) and 
separate theoretical proportions of 
avoidance responses were calculated 
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for the fast learner group and the 
slow learner group. These propor- 
tions were then averaged, giving one 
theoretical proportion for each of the 
10 blocks of trials. The theoretical 
curve drawn through these values, 
together with the observed propor- 
tions, is shown in Fig. 1. Since the 
values for m; and w, used in Exp. 2 
were the same as those used in Exp. 1, 
there is some basis for assuming that 
the 32 Ss of Exp. 2 should be similar 
to the 96 Ss of Exp. 1 in learning to 
avoid. To check this, the same @ 
values estimated from the two groups 
of 48 Ss in Exp. 1 were used as the 
estimates for fast and slow learner 
groups of Exp. 2. The satisfactory 
fit shown in Fig. 2 was obtained. 

A, avoidance responses: Series 1. 
The observed proportions of A, avoid- 
ance responses in the first series of 
Exp. | for the three event schedules 
are shown in Fig. 3. In all cases, the 
proportions are averages for five-trial 
blocks. To see whether the differ- 
ences associated with the two inde- 
pendent variables were significant, a 
2 X 3 analysis of variance was com- 
puted for total A; avoidance responses 
over the 50 trials in Exp. 1. The 
effect of event schedules was signifi- 
cant at the .O1 level, but the effect of 
the signal termination variable was 
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Fic. 3. Proportion of A, avoidance responses 
per five-trial block in Series 1 for Event Sched- 
ules I, II, and III. 
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not significant. It may be concluded 
that when both A; and A, lead 
uniformly to avoidance, i.e., 7, = m2 
= 1,00, the proportion of A, avoid- 
ance responses is directly related to 
the probability, ¢, that A, leads to 
escape. Additional support for this 
conclusion is obtained from Exp. 2. 
The smallest total number of A, 
avoidance responses during Series 1 
was found for Event Schedule IV— 
the group with the smallest @ value. 

A, avoidance responses: Series 2.— 
The observed proportions of A, avoid- 
ance responses on Series 2 of Exp. 1 
for the three event-schedule groups 
are shown in Fig.*4. Although the 
mean curves in Fig. 4 are essentially 
horizontal, Ss within each group 
starting with high or low probabilities 
showed negatively accelerated learn- 
ing curves. The probabilities asso- 
ciated with Ss starting high came 
down toward the asymptotic value, 
while the probabilities associated with 
Ss starting low rose toward the 
asymptotic value. 

The total number of A, avoidance 
responses on the last 20 trials was 
selected as the estimate of the asymp- 
tote. ‘To determine whether the six 
groups of Exp. | were significantly 
different with respect to these totals, 
a 2 X 3 analysis of variance was run. 
As the variances of these totals were 
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Fic.4. Proportion of A; avoidance responses 
per five-trial block in Series 2 for Event Sched- 
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not homogeneous, the totals were 
transformed, using the arc-sine trans- 
formation (7), before the analysis was 
carried out. The effect of event 
schedules was significant at the .0O1 
level, but the effect of the signal 
termination variable was not signifi- 
cant. It may be concluded that, for 
fixed values of w, and m2, the asymp- 
tote is directly related to the proba- 
bility, @, that A, leads to escape. 


From a contiguity point of view, the 
acquisition of avoidance behavior should 
be facilitated by the contingent pro- 
cedure of signal termination. On non- 
avoidance trials with the contingent 
procedure, a correct response not only 
led to escape from D but also terminated 
the signal. Signal termination, there- 
fore, became associated with the correct 
response. If § attempted to terminate 
the signal as quickly as possible, an 
avoidance response was made, since the 
signal came on 1.8 sec. prior to the occur- 
rence of D. While the signal termina- 
tion variable had an effect on the 
acquisition of avoidance behavior, it had 
no effect on the proportion of A; avoid- 
ance responses made by 5s who learned 
to avoid. Although the signal was 
terminated by the correct response with 
the contingent procedure, signal termi- 
nation occurred regardless of whether an 
A; or an Az was the correct response. 
Signal termination, therefore, had no 
influence on the proportion of A; avoid- 
ance responses. 

Consider now the hypotheses that 
there is an increase in the probability of 
a response (a) when the response leads 
to avoidance and (4) when the response 
leads to escape. Starting with these 
hypotheses, the following expression for 
the mean asymptotic probability value 
of an A; avoidance response [pi(*)] 
can be derived [see (1) for formal 
derivation ]. 

pi(o) = ot) __ 

1 — T+ 7b — Tap 
One can test the hypotheses by inserting 
into the above equation the appropriate 
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TABLE 2 
‘THEORETICAL AND Osservep AsymprToric 
VaLues ror THE Four Event 
ScHEDULES 
Event Schedule 
Measure 


Theoretical . 79 50 


Observed ; 71 53 


values of 41, 2, and @ for the four event- 
schedule groups. This gives the pre- 
dicted asymptotic values which can then 
be compared with the empirically de- 
termined asymptotic values. Table 2 
shows these theoretical asymptotes, to- 
gether with the observed asymptotes 
based on the total number of A; avoid- 
ance responses on the last 20 trials. As 
can be seen, the correspondence is fairly 
good. The hypotheses selected for 
avoidance and nonavoidance trials do 
lead to satisfactory agreement between 
theoretical predictions and observed 
results. 


SUMMARY 


In an avoidance learning situation the 
variable of partial reinforcement was introduced 
and the predictive power of Estes’ statistical 
learning theory was tested. 

Following the onset of a signal, S could make 
a response which resulted in the avoidance of 
darkness on a tracked target. If darkness 
occurred, S escaped by making a response which 
turned the target light back on. Eight groups 
of 16 human Ss each were used. The probability 
that a given response would avoid was varied, 
as was the probability that a given response 
would escape on a nonavoidance trial. ‘Termi- 
nation of the signal was either contingent or not 
contingent upon a correct response by S. 

Acquisition of the avoidance response was 
facilitated by (a) uniform, as compared to par- 
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tial, reinforcement of the escape response on 
nonavoidance trials and (b) the contingent condi 

tion of signal termination. ‘The asymptotic level 
of avoidance learning was directly related to the 
probabilities with which the given response was 
reinforced either by avoidance or, on nonavoid 

ance trials, by escape. ‘The forms of acquisition 
curves and the asymptotic response probabilities 
predicted from the theory were in good agree- 
ment with the data. 
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THE LENGTH-DIFFICULTY RELATIONSHIP IN 
COMPOUND TRIAL-AND-ERROR LEARNING ! 


CLYDE E. NOBLE 


Montana State Unwersity 


Earlier experiments in this series 
(8, 9) have studied the effects of 
varying the number of available 
responses (Ng) in a nonverbal serial 
multiple-choice task (11). At- 
tempting an extension of Hull’s (1) 
theory to human trial-and-error learn- 
ing, the writer (7, 10) applied the 
following rational equation to the 
acquisition curves: 

R, = ali)”, (1) 
where R, = probability of a correct 
first choice (IR +-), a = asymptote of 
R, (1.00), «= initial probability 
(1/Nr), r = rate parameter related 
to Nz, and N = number of trials. 
The average percentage of variance 
in R, accounted for was 99.1%. 

The present investigation is a 
further test of the generality of 
Equation 1. In preceding studies the 
number of choice points (Np) was 
held constant while response availa- 
bility (Ne) varied. In this experi- 
ment Np and Ne were covaried, as in 
serial verbal learning. ‘The hypothe- 
sis was that the parameters i and r 


This report is based on work done under 
ARDC Project No. 7707, Task No. 77133, in 
support of the research and development pro- 
gram of the Air Force Personnel and Training 
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December, 1955. 


should be sensitive to manipulations 
of Np as well as Ng. Related to this 
was the problem of the regression of 
difficulty upon length of task. Since 


the mathematical form of the length- 
difficulty relationship is contingent 
upon scale factors, several types of 
difficulty measure will be analyzed. 


Metuop 


Apparatus.—The learning device, which is 
described in detail elsewhere (10, 11), is a semi- 
automatic 19-key pushbutton apparatus called 
the Selective Mathometer. The time constants 
were the same as those used previously (8, 9). 
Overt errors and omissions were scored as R —’s, 
and late choices were recorded but not scored. 
The stimuli were 14 distinctive electrical circuit 
symbols chosen for their unfamiliarity to Ss. 
These items were prepared as 2 X 2-in. slides to 
constitute a trial sequence. Stimulus presen- 
tation followed an invariant, paced sequence for 
any given S, but the correct responses were 
required in 12 random permutations for each 
experimental group. 

Four treatments were given, in which the 
ratios Ne/Np were 4/4, 6/6, 10/10, and 14/14. 
The same keys were exposed as in corresponding 
conditions of the two experiments on response 
availability (8, 9). Considering S’s keyboard 
as being numbered from left to right, the fol- 
lowing keys were unmasked for each group: 


Ne Available Keys 


4 4,8,12,16 

6 4,6,8,12,14,16 

10 2,4,6,8,9,11,12,14,16,18 

14 1,3,4,5,7,8,9,11,12,13,15,16,17,19 

In each condition S’s problem was to discover 
from among all possible solutions the correct 
sequence of 4 (6, 10, or 14) reactions corre- 
sponding to the 4 (6, 10, or 14) stimuli. As 
before, S was instructed to use the forefinger of 
his preferred hand in making selections and to 
return to the fixation plate after each response. 
Repeated choices were allowed during the stimu- 
lus period (modified-correction procedure), but S 
was informed that responses made during the 
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interstimulus or intertrial periods would not be 
scored. Mastery was defined as five successive 
trials without error, a more stringent criterion 
than was employed in earlier research. The reason 
forincreasing the criterion was thatapproximately 
30% of Ss who attain two successive perfect 
trials subsequently make one or more errors. 
With a criterion of five trials, however, the 
percentage of backsliders is reduced by half. 

Procedure.—A simple randomized design was 
employed in which the interaction of task length 
and permutation effects was confounded with 
individual differences, exactly as in the first two 
experiments. Each of the four Np treatments 
was administered under 12 random response 
permutations (omitting left-right and right-left 
sequences), with four Ss appearing in each cell. 
To minimize intraserial effects, the stimulus 
sequence was changed at random from one 
group of Ss to the next. 

The Ss consisted of 192 basic airmen at 
Lackland Air Force Base, assigned randomly to 
the four experimental treatments within each 
permutation. They were tested in subgroups 
of four, and none had prior experience with the 
Mathometer. For this investigation, four copies 
of the response panel and recording devices were 
constructed. The Ss were arranged in a semi- 
circle but visually isolated from each other by 
plywood walls. The problem stimuli were 
projected on a standard opaque screen to the 
front, with the equipment located to the rear. 
Recorder, counter, and reward circuits were, of 
course, separate for each S. When S attained 
mastery he was unobtrusively dismissed from 
the laboratory by means of a yellow signal lamp 
on his response panel. ‘Two E’s were employed, 
one to operate the control unit and the other to 
monitor the recording devices. Following the 
experiment, each S was isolated from the re- 
maining pool of Ss in order to prevent discussion 
of the procedure. 


Resu._ts AND Discussion 


The acquisition curves are presented 
in Fig. 1, where the probabilities (R,) of 
correct initial choices (1R +-) are plotted 
as functions of trials (N) for the four 
values of Np. Consistent with previous 
work on response availability (8, 9), the 
asymmetrical sigmoid curves are well 
represented by Equation 1. The best- 
fitting equations, obtained by minimizing 
the sums of squares of the residuals to 
determine the parameter r, are as 
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Fic. 1. Probability of initial correct choice 
(R,) as a function of practice (N) with number of 
choice points (Np) as the parameter. The or 
igins are rational. 


follows: 
Np R, = 1.00(.250)™*", [2] 
Np R, = 1.00(.167)", [3] 
Np R, = 1.00(.100)"", [4] 
Np = 14: Ry = 1.00(.071)™*", [5] 


The computational procedure begins by 
taking the second derivative of Equation 
1. This gives 

@R, 


“AN? _ a(i)**r (log.i) (log.r)? 


[1+ r%(log.4)]. [6] 


It can be shown (2, p. 130) that the 
point of inflection (N,) on the practice 
dimension is found by setting Equation 
6 equal to zero. Solving for Ni we have 


N, = — log.(— log.) (7] 


log.r 


from which it follows that 


— log.(— log.) 
N ° [8] 


log.r = 


A preliminary estimate of N, is first 
obtained by inspection of the acquisition 
curve. From the ordinate at 1/e = .368, 
a perpendicular is dropped from the 
empirical R, function to the abscissa. 
This value of N, is then substituted in 
Equation 8, from which an approxima- 
tion to the rate parameter is obtained. 
By successive solutions of Equation 1, 
paired values of observed (R,) and 
expected (R,) response probabilities 
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result. This procedure is repeated for 
selected values of r until the quantity 
~~ (Rk, — R,)* is a minimum. 

Coefficients of determination (R*) for 
Equations 2 through 5 were computed 
directly from the variances of the 
residuals and the RK, values. Having 
the variance of the residuals (o?,) and 
the observed variance (0%), the formula 
is R? = 1 — (0*,,/0%). The percentages 
of determined variance (100R*) are 
97.2%, 98.9%, 99.6%, and 99.7% in the 
order listed. Less distortion results 
from fitting the curves directly to the 
raw data than by rectifying Equation 1 
via the conventional linear reduction 
process (cf. 2), and the present method 
is also more rapid. As predicted, the i 
and r parameters jointly account for 
variations in Np. There is a related 
effect on the inflection points; employing 
Equation 7, zero accelerations occur at 
1.81, 5.79, 12.82, and 17.75 trials for 
Groups 4, 6, 10, and 14, respectively. 
This positive relationship between N, and 
Np can also be seen by inspection of 
Fig. 1. 


There are some irregularities in the 


data. The Trial 1 score for Group 4, 
for instance, shows that these Ss are 
performing below chance initially. The 
empirical R, value for Group 4 on Trial 
1 is .141, whereas the theoretical value 
is .314 reading from Equation 2. By a 
z test for proportions the difference of 
.173 is found to be 2.58 times the a7 of 
.067, which is significant at the 1% point. 

Using a x? test of goodness of fit (two- 
category case), the hypothesis of a ran- 
dom distribution of correct responses at 
the first choice point may be evaluated. 
The logic of this test (3) is that the 
population of initial choices in Group 4 
is distributed according to an asym- 
metrical binomial: R, = .25, Ry = .75. 
The x? for df = 1 is 5.44, which is sig- 
nificant at the 2% point and hence a 
departure from the hypothesis of chance 
responding. When the same test is 
applied to Group 6, the x? is 2.40, which 
is not significant. Inspection of Fig. 1 
indicates that Groups 10 and 14 also 
respond about as would be expected 
from their appropriate binomial hy- 
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Fic. 2. Mean total errors (R —) as a func- 
tion of practice (N) with number of choice points 
(Np) as the parameter. 


potheses. Thus the likelihood of bias 
seems to be greater for shorter tasks than 
for longer tasks. It is an early effect 
which disappears by Trial 3. 

We turn next to an analysis of error 
curves. In Fig. 2 mean errors (R —) 
are shown as functions of trials (N) for 
the four values of Np. Except for 
Group 14 each curve originates at an 
appropriate point on the ordinate; Ss in 
Group 14 make proportionately more 
errors (overts as well as omissions) on 
the initial trials than do the other groups. 
There is also a suggestion of an early 
increase in errors followed by a later 
fall for this group. 

To any analysis of error data in 
sequential learning the question of serial 
position effects is inevitably linked. 
Since this phenomenon is most frequently 
observed during early stages of practice 
Fig. 3 presents the mean errors 
(f —) cumulated during Trials 1-5 for 
each group. There is no evidence of the 
classical function. This horizontal error 
distribution is typical of Mathometer 
data (8, 9); rather than a bowed pri- 
macy-recency effect, there is fairly equal 
difficulty at each choice point in the 
series. To evaluate this behavior more 
fully, independent mixed factorial analy- 
ses of variance (4, Type I) were per- 
formed upon the data in Fig. 3. 

Letting the variable A-classifications 
represent serial positions and the 12 
B-classifications represent permutations, 
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Fic.3. Mean errors (R —) during Trials 1-5 
as a function of serial position with number of 
choice points (Np) as the parameter. 


each cell mean was based upon the pooled 
error scores of four Ss for the first five 
trials. Serial position (A) is a significant 
source of variance for Group 4 (F = 3.28; 
df = 3/108) and Group 14 (F = 2.24; 
af = 13/468); permutation effects (B) 
are significant only for Group 6 
(F = 2.07; df = 11/36); and the A X B 
interactions are significant in all groups 
(F > 2.11; df > 33/108). Primary in- 
terest, however, lies in the A-effects. 
Although there are significant differences 


among choice points for Groups 4 and 14 
in Fig. 3, there is no necessity to test for 


the typical bowed hypothesis. (An ap- 
propriate test might be a comparison of 
the terminal and center pairs of points.) 
The only obvious trend is that for Group 
4, but since neither of the previous 
studies (8, 9) revealed a gradient for the 
4/4 condition, this must be interpreted 
as experimental error. In other words, 
the Mathometer response units comprise 
a chain whose links are approximately 
equal with respect to variations of Nez 
and Np. 
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Finally, the relationships between 
“difficulty,” variously defined, and 
length of task must be considered. 
Table 1 presents five different measures 
of difficulty as functions of Np. The 


scores are defined as follows: 


N = median total trials to mastery, 
computed from the trial number on 
which each S’s mastery run is 
begun; i.e., the first of 5 perfect 
trials. 
mean initial errors; i.e., the average 
number of errors cumulated during 
Trials 1-5 including omissions. 
median total errors to mastery, 
recorded over the same trials as N. 
rate parameter, calculated by em- 
pirical curve-fitting in Equations 1 
and 8. ° 
inflection point, computed from 
Equation 7 as the trial number on 
which acceleration is zero. 


In the case of each type of score in 
Table 1, the index of difficulty increases 
with Np. None of the measures, how- 
ever, is appropriate for determining the 
law governing the amount of work 
required to learn different amounts of 
the same material. As McGeoch (5) 
pointed out, trials-to-criterion scores 
fail to hold temporal duration constant; 
thus, although N is a positive (linear) 
function of Np, the units of measurement 
change systematically from group to 
group. Initial errors are also unsatis- 
factory; the amount of practice is com- 
parable but a uniform performance 
criterion for R — is lacking. The R — 
measure meets the foregoing objections 
because total errors are correlated with 
total time; unfortunately, however, k — 


TABLE 1 


Various Perrormance Measures or Dirricutty as Functions or THE 
Numper or Cuorce Ports (Np) 


Median 
Total 
Trials (N) 


Mean 
Initial 
Errors (R 


Median 
Total 
Errors (R —) 


Inflection 
Point 
(No) 














1.81 
5.79 
12.82 
17.75 


20.0 
67.0 
234.0 
473.5 
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is difficult to rationalize since it is so 
strongly affected by individual differ- 
ences. Rate parameters provide a more 
rational index of difficulty, but r is 
subject to the same criticism as N with 
respect to the problem of equal units. 
The AN, concept, despite its abstract 
properties and the possibility of coordi- 
nating it with Equation 1, must also be 
rejected on similar grounds. The most 
suitable index, derivable from the trials- 
to-criterion measure, is the total time 
required to reach mastery. Here the 
units are equal, the criterion is uniform, 
and the dimension has a meaningful 
origin. 

Perhaps the most precise hypothesis 
about length-difficulty phenomena is one 
which Thurstone (13) derived from his 
rational equation (12) for the verbal 
learning function: 

T = c/k(Np) V Ne — 5, [9] 
where 7 = total time to criterion, c 
= E’scriterial constant, k = S’s learning 
constant, Np = number of choice points, 
and s = S§’s span of attention. 

By making some simplifying assump- 
tions, it can be shown that Equation 9 
as applied to the Mathometer is essen- 
tially a parabola. First of all, the 
concept of attention span (s) is not 
relevant to trial-and-error learning, so 
we may sets = 0. Second, there should 
be a meaningful T value for a single-link 
problem, and it would be desirable to 
have the curve originate at zero. Since 
only by chance can S make a perfect 
score in any condition on Trial 1, we 
may let the time required for a single 
predetermined choice be equal to the 
stimulus duration plus the interstimulus 
and intertrial periods. The constant 
ratio c/k, therefore, may be replaced by 
the (log intercept) parameter a specifying 
the unit time of work and rest. Finally, 
since Np V Np = N>'-*, we may assume 
more generally that the equation we seek 
has a slope parameter 4 whose value is 
greater than unity, thus producing 
positive acceleration as in Thurstone’s 
(13) experimental examples. 

From the foregoing, it follows that a 
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simple logical adaptation of Equation 9 
to the present situation would be 


T = a(Np)’, 


[10] 


where 7’ = median total time to mastery 
in seconds, calculated for each Np con- 
dition as the median trials to mastery 
(N) multiplied by the trial duration, i.e., 
N(N p(2.00 + 1.44) + 6.88]; @ = trial 
duration in seconds for Np = 1, calcu- 
lated as the sum of the stimulus, inter- 
stimulus, and intertrial periods, i.e., 
10.32 sec.; and 4 = slope parameter 
> 1.0, inversely related to ability and to 
distribution of practice. 

By causing the curve in Fig. 4 to pass 
through the rational points (0,0) and 
(1, 10.32), then minimizing the sums of 
squares of the remaining four residuals 
to determine 4, the best-fitting parabolic 
equation is 


T = 10.32(Np)*™. 11] 


The per cent of variance in difficulty 
ascribable to Equation 11 is 99.8%. 
The major hypothesis of the present 
experiment may be considered verified; 
i.c., that the parameters i and r in 
Equation 1 both respond to joint vari- 
ations of task length and 
availability. Furthermore, it appears 
that the basic form of probability 
acquisition curves in human trial-and- 


response 
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error learning is invariant over changes 
in the number of choice points. What 
is denoted here by “basic form” is the 
general mathematical equation (exclu- 
sive of the specific numerical values of 
its parameters) which most closely 
approximates the empirical data; i.e., 
Equation 1. This hypothesis will be 
recognized as a logical consequence of 
the Kjerstad-Robinson law (5, p. 48). 

There remains the problem of the 
regression of “difficulty” upon length of 
task. The present analysis, in agree- 
ment with McGeoch (5) and Thurstone 
(13), indicates that total time to mastery 
is the most appropriate index of diffi- 
culty, and that a rational equation for 
this function has the parabolic form 
Y = aX’, where 4>1. Assuming, 
therefore, that the length-time relation- 
ship for compound trial-and-error learn- 
ing is of the type described by Equation 
10, it is interesting to compare the results 
of a similar investigation of perceptual- 
motor learning by the writer (6). The 
apparatus was the Complex Coordination 
Test. The variable Np was manipu- 
lated in terms of the number of successive 
visual patterns to be matched, and 
difficulty was expressed as total time per 
match. In that experiment 99.9% of 
the variance was determined by a /inear 
equation. Although the differences be- 
tween trial-and-error learning and _ per- 
ceptual-motor skill are many, both 
experiments involved the acquisition of 
an invariant sequence of paired asso- 
ciates. That is, each response in the 
series was connected to a unique stimu- 
lus, and the reinforcing event was 
independent of the preceding response 
and of the succeeding stimulus.? 

As was previously discussed (6), 
positive acceleration is a characteristic 
of “competitional” learning problems. 
whereas “‘noncompetitional” situations 
give linear relationships. In the former 
case, time per unit is contingent upon 
task length, but in the latter it remains 


2In mazes and serial verbal learning these 
factors are confounded, with the result that 
anticipatory and perseverative errors occur most 
frequently in the middle of the chain. 
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constant. Thus, despite its freedom 
from classical serial position effects, the 
Selective Mathometer is a sequential 
learning device of the “‘competitional” 
or habit interference type. Apparently, 
however, the successive acts in a com- 
pound chain may compete with each 
other for order of appearance without 
producing differential probabilities of 
error at the several choice points. The 
practical importance of this is that one 
may investigate the dependence of 


difficulty upon complexity in the absence 
of intraserial complications. 


SUMMARY 


This experiment involved compound trial- 
and-error learning under four conditions of task 
length (Np). The Ss (N = 192) practiced 
series of either 4, 6, 10, or 14 pushbutton re- 
sponses to a criterion of five perfect trials. 

Response probability curves for the four Np 
conditions were skewed and S-shaped. Fach 
curve originated at a hypothetical R» value 
consistent with the initial chance solubility (1) 
of the problem. The inflection points (N4) 
were a direct function of Np, and the acquisition 
rates (r) varied inversely. The rational equa- 
tion R, = a(i)r* accounted for an average of 
98.8% of the variance. 

With respect to the regression of difficulty 
upon length of task, six different types of score 
were analyzed. The index 
was the median total time to mastery in seconds 
(1). The rational equation T = a(Np)* ac- 
counted for 99.8% of the variance. Consistent 
with earlier research, errors were practically 
independent of serial position but there were 
significant permutation by position interactions 


most appropriate 


The findings were discussed in relation to 
Thurstone’s length-difficulty function and the 
K jerstad-Robinson law. 
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EFFECT OF PATTERN VARIATION UPON 
VERBAL MAZE LEARNING! 


R. F. THOMPSON, JAMES F. VOSS,? AND W. J. BROGDEN 


University of Wisconsin 


In previous studies on the effect of 
number of alternate choices per unit 
of a verbal maze on acquisition and 
serial position errors, note was made 
of an apparent effect of the pattern 
of the series of correct responses (1, 
2). Even though the maze pattern 
was generated by use of a table of 
random numbers, acquisition and 
frequency of serial position errors 
appeared to be altered differentially 
by certain restricted sequences within 
the total pattern. Since attempts to 
provide a quantitative scale for pat- 
tern characteristics by means of 
mathematical models have not to 
date been successful, empirical ap- 
proaches to systematic variation of 
pattern characteristics were con- 
sidered. One such approach involves 
systematic variation of the location 
of a doublet (two successive like 
responses) within the maze pattern. 
The present paper reports the results 
of an experiment designed to investi- 
gate the hypothesis that systematic 
variation in the location of a doublet 
within the maze pattern will affect 
acquisition and serial position errors. 


Metuop 


The 16-unit maze with four alternate choices, 
each occurring four times in the pattern, used by 
Brogden and Schmidt (1) was altered in the 
following manner. The identical pattern was 
used for the control group except that the 
alternative choices were the numbers 10, 20, 30, 
and 40. For the five experimental groups, the 

1This research was supported in part by a 
grant from the Graduate Research Committee 
from funds provided by the Wisconsin Alumni 
Research Foundation. 

2 Wisconsin State College, Eau Claire. 


basic maze pattern was changed to provide a 
doublet for Maze Units 2 and 3, 5 and 6, 8 and 9, 
1] and 12, and 14nd 15. The pattern for each 
one of the experimental groups and that for the 
control group was permuted by first adding 10 
to each number of the group pattern, second by 
adding 20, and third by adding 30. The num- 
bers 50, 60, and 70 were reduced to 10, 20, and 
30, respectively. The four subpatterns thus 
obtained provide for the representation of each 
number once in each of the 16 positions. The 
doublet in each experimental group pattern is 
represented by each of the following pairs: 10-10, 
20-20, 30-30, and 40-40. 

There were 24 Ss in each group and six Ss of 
each group learned one of the four subpatterns 
provided by the permutations of the numbers. 
Half of the Ss in each condition were handled by 
each of two Es, so that E was orthogonal to the 
other experimental conditions. The Ss were 
members of the classes in elementary psychology 
at the University of Wisconsin who volunteered 
to serve in an experiment on learning a verbal 
maze. The Ss were assigned to the various 
conditions by a random procedure. 

The following instructions were read by E to 
each S. “In this experiment you will be re- 
quired to learn a single list of numbers. These 
are the numbers used—10, 20, 30, and 40. Each 
number is used more than once in the list. 
When‘I tell you to begin, start naming these 
four numbers. When you name the correct 
number for the first position of the list, I will say 
‘forward.’ Then start naming the numbers 
again. When you name the second correct 
number of the list, I will say ‘forward.’ This 
procedure will be continued until the end of the 
list is reached. Please respond as fast as you 
can, When the first trial is completed, there 
will be a short rest before the second trial is 
started. We will continue with similar trials 
until you are able to make one repetition of the 
list without error. Are there any questions?” 

After answering any questions, E said 
“Begin” and S responded by giving one of the 
four alternate responses. The procedure de- 
scribed in the instructions was followed until S 
attained the criterion of one errorless trial. The 
E recorded every response for each unit on each 
trial. A stopwatch was used to measure the 
time of each trial and the one-minute rest 
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interval between trials. Other conditions of the 
experiment were comparable to those used in 
previous studies (1, 2). 


REsuLts 


The number of trials, the total 
errors, and the total time to the 
criterion trial were tabulated for each 
S. An analysis of variance was per- 
formed on the four permutations of 
the maze pattern for the control group 
and for each of the five experimental 
groups for the three measures of 
acquisition. Since no significant dif- 
ferences between permutations were 
found for any of the groups, the data 
from the permutations were pooled 
for further analysis of the acquisition 
data. An analysis of variance for 
each of the three sets of acquisition 
data was performed and in no case 
was maze pattern (doublet position) 
a significant source of variation, nor 
were there any other significant 
differences. 


Mean group error over all trials to 
the criterial trial was computed for 
each group to obtain the data for 


plotting the filled-circle solid-line 
serial position error curves presented 
in Fig. 1. Examination of these 
curves indicates that those for the 
experimental groups are of different 
form from each other and from that 
of the control group. When com- 
parison is made between each curve 
of the experimental groups and the 
curve for the control group at the 
locus of the doublet in the curve of the 
experimental group (represented by 
the two vertical arrows with unbroken 
connection) the frequency of error at 
the position of the second item of the 
doublet is greater than is the fre- 
quency of error at the position of the 
first item of the doublet. This phe- 
nomenon is more directly shown by 
the open-circle broken-line curves 
plotted for each of the experimental 
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groups. These represent the differ- 
ence between the serial position error 
curves of the appropriate experi- 
mental group and the control group. 
Since irregularities of similar nature 
to those apparently produced by the 
doublet were noted in each of the 
curves for the experimental groups, 
an examination was made of the maze 
patterns. For nearly all cases of 
substantial deviation of the nature 
apparently produced by a doublet, it 
was found that a given response item 
was followed by an identical response 
item one position removed, e.g., 20— 
10—20 for the Positions 13, 14, and 
15 of the maze pattern for Group El. 
This sequence form we have called a 
split-doublet. There are no split- 
doublets in the pattern of the control 
group. Their occurrence in the pat- 
terns of the experimental groups is a 
function of the method of altering the 
pattern of the control group in order 
to provide for the location of a doublet 
at different serial positions in the 
experimental group patterns. 

The differences between the five 
experimental group curves repre- 
senting the difference of each S in the 
experimental groups from the control 
group curve were tested by an analysis 
of variance, the results of which are 
summarized in Table 1. Serial Posi- 
tion, the interaction of Serial Position 
and Maze Pattern, and the triple 
interaction of Serial Position, Maze 
Pattern, and Maze Pattern Subgroups 
are the only significant sources of 
variation. 

The significance of the Serial Posi- 
tion source of variation indicates that 
the points of the over-all curve differ 
as a function of serial position. The 
interaction of Serial Position and 
Maze Pattern indicates that the form 
of the Serial Position curves varies as 
a function of maze patterns or groups. 
The differences between curve form 
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include those produced by the locus of 
both the doublet and split-doublet 
sequences, since these two variables 
are confounded. The significant 
triple interaction indicates that the 


serial position curves for the different 
maze patterns differ as a function of 
the maze pattern subgroups. 

In order to obtain tests of the effects 
apparently produced by the doublet 
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TABLE 1 


Anatysis oF VARIANCE oF SERIAL 
Postrion Errors 





Source 





Between Ss 
Maze Pattern (M) 
Experimenter (£) 
Maze Pattern Sub- 


Residual Between Ss 
Within Ss 

Serial Position (SP) 

S M 


Residual Within Ss 














* Significant at .0S level. 


and split-doublet sequences, a ¢ test 
of the difference between means 
(difference between experimental and 
control groups) of the first and second 
doublet position over all the experi- 
mental groups and a ¢ test of the 
difference between means (difference 
between experimental and _ control 
groups) of the first and third split- 
doublet positions over all the experi- 
mental groups were performed. Both 
tests resulted in values of P smaller 
than .05. 

The increase in errors at the second 
item of the doublet and the third item 
of the split-doublet may be similar to 
the negative recency effect reported 
by Jarvik (3) for probability learning. 
The present four-choice alternate 
response and maze length represent a 
marked difference from Jarvik’s two- 
response guessing situation and series 
length. An analysis of the first trial 
of the present Ss provides a test of 
whether their guessing behavior con- 
tributes to the effect of the doublet 


sequences. The frequency with 
which each of the four alternative 
responses followed each of the correct 
responses in the maze pattern was 
tabulated on the first trial for every 
S of each of the six groups and these 
data are presented in Table 2. Ex- 
amination of the table shows that for 
each group the frequency of response 
of the previously correct response is 
less than any other kind of response 
except in the case of a response of 10 
(diagonal from upper left to lower 
right cell for each group). A chi- 
square test of the over-all frequency 
distribution of Table 2 was performed 
in order to test the hypothesis that the 
distribution of responses differs from 
chance. This test is significant at 
better than the 5% level. Although 
this test is not direct, its results are 
consistent with the hypothesis that 
during the discovery phase of learning 
the four-choice maze pattern, Ss tend 
not to repeat the response that was 
correct in the immediately preceding 
position. The one exception to this 
tendency is the frequency of the first 
response being 10 even when the 
previously correct response was 10. 
This may be due to the ordinal series 
of the alternate responses and its 
presentation to S in the instructions 
in the order 10, 20, 30, 40. The 
frequency of the 10 response may 
account for the significant triple 
interaction of Serial Position, Maze 
Pattern, and Maze Pattern Sub- 
groups, since 10, 20, 30, and 40, were 
equally represented in the doublets 
and split-doublets. An examination 
of the responses on Trial 2 gave similar 
results, except that there was a de- 
crease in the frequency of 10 as the 
first response following any alter- 
native. This change may be at- 
tributed to effects from the first trial. 

The first-order contingent responses 
to the third element of all three 
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TABLE 2 


Frequency or Response on THE Finst Tria ror Eacu or tHe Four ALTeRnative 
Responses Fottowinc Eacu Correct Response or tHe Maze Patrern 


Prior 
Correct 


First Order Contingent Response 





Response 10 20 | 30 | 


wo | 10 | 





Group C 
46 




















element sequences were tabulated for 
the first trial for each S of all groups 
in terms of the first element. This 
frequency distribution was tested for 
the split doublet effect by chi square. 
Although the test was significant at 
the 5% level, the distribution differs 
from that obtained previously for the 
two-element sequences used in testing 
the doublet effect. There is a high 
frequency of responses of 10 similar to 
that noted in the data of Table 2, but 
there is no decrease in frequency of 
response along the diagonals as was 
found in the data of Table 2. Thus, 
there is no indication that the Ss tend 
not to repeat a response that was 
correct for the maze unit two positions 
backward. Since the presence of the 
split-doublets in the maze patterns 
was not planned, no other tests of the 
split-doublet effect were attempted. 


Discussion 


The discovery of the effect of the split- 
doublet sequence upon serial position 
errors has revealed the experimental 
design of the present study to be faulty 
in terms of its intentions. Instead of 
providing for an analysis of the effect of 
maze pattern upon acquisition and serial 
position error curves as a function of the 
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locus of doublets, the independent vari- 
able of maze pattern represents a con- 
founding of the presence and position of 
doublets and split-doublets. The failure 
to find any significant effect of maze 
pattern upon trials, time, or errors as 
acquisition measures may be due to 
interactions between the doublet and 
split-doublet variables. The significant 
effect of maze pattern upon errors to the 
criterial trial for the experimental groups 
minus the errors to the criterial trial for 
the control group suggests that doublet 
position, split-doublet position, or both 
may have an effect upon measures of 
acquisition. 

The significant effect of doublet and 
split-doublet position upon the form of 
the serial position error curve is clearly 
established. Although the over-all form 
of the serial position error curve is a 
function of the presence and loci of 
doublet and split-doublet sequences, the 
effect in both cases is one of an increase 
in error. In the case of the doublet 
sequence, the second item of the doublet 
produces a frequency of error signifi- 
cantly greater than does the first item. 
In the case of the split-doublet sequence, 
the third item of the sequence produces 
a significantly greater frequency of 
error than does the first item. A more 
precise determination of these effects 
will result, of course, from experiments 
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in which doublet and _ split-doublet 
position are manipulated separately as 
independent variables. 

The analysis of the first order con- 
tingent responses indicates that the 
guessing habits of Ss and the instructions 
given S about the sequences of guessing 
may be critical factors in the acquisition 
of alternate choice maze patterns, par- 
ticularly during the discovery phase. 
The interaction of these variables with 
variables of maze pattern complicate 
investigations of the effects of the latter 
variable on acquisition and serial position 
errors. It may be hypothesized, how- 
ever, that the interaction of these two 
kinds of variables may result in facili- 
tation as well as in a decrement of 
acquisition of verbal mazes. 


SUMMARY 


This experiment was designed to test the 
effect of locus of a doublet (two successive like 
responses) on the acquisition and serial position 
errors of a 4-choice, 16-unit verbal maze. The 
maze pattern for the control group had no 
doublets. Those for the five experimental 
groups had a single doublet at maze units 2 and 
3, 5 and 6, 8 and 9, 11 and 12, and 14 and 15, 
respectively. Analysis of variance of the acqui- 
sition data in terms of trials, time, and errors to 
the criterial trial resulted in no significant 
differences. Examination of serial position 
errors in terms of the difference between the 
control group and each of the experimental 
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groups revealed an apparent effect of an increase 
in error for the second item of the doublet. A 
similar effect was discovered for split-doublets 
(two identical response items with a different 
response item between them) in that frequency 
of error was greater for the second one of the two 
identical responses. The split-doublet  se- 
quences occurred due to alteration of the pattern 
for the control group to provide for the doublets 
in the maze patterns for the experimental 
groups. Analysis of variance of the serial 
position error data showed significant differences 
in the form of the curves. Further analysis by 
t tests revealed the doublet and split-doublet 
effects to be significant. Analysis of first order 
contingent responses suggests that Ss tend not to 
repeat a response that was previously correct. 
This kind of guessing behavior may account for 
the doublet effect. 


REFERENCES 


1. Brocpen, W. J., & Scumipt, R. E. Effect 
of number of choices per unit of a verbal 
maze on learning and serial position 
errors. J. exp. Psychol., 1954, 47, 235- 
240. 

2. Brocpen, W. J., & Scumipt, R. E. Acqui- 
sition of a 24unit verbal maze as a 
function of number of alternate choices 
per unit. J. exp. Psychol., 1954, 48, 
335-338. 

3. Janvix, M. E. Probability learning and a 
negative recency effect on the serial 
anticipation of alternate symbols. /. 
exp. Psychol., 1951, 41, 291-297. 


(Received August 27, 1956) 





Journal of + Psychology 
Vol. 54, No. 957 


INCIDENTAL LEARNING 


AT FIVE STAGES OF 


INTENTIONAL LEARNING 
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This study investigates the acqui- 
sition of incidental learning at pro- 
. gressive stages of intentional learning. 
Specifically it is hypothesized that 
incidental and intentional learning 
are complementary, so that incidental 
learning occurs primarily at those 
stages of intentional learning when 
motivation is relatively low, or when 
S is uncertain of the task set by E. 

Several recent studies (2, 4,5) have 
indicated that incidental learning is 
subject to many of the same effects 
observed with intentional learning. 
The progress of incidental learning in 
relation to the simultaneous progress 
of intentional learning has, however, 
not been investigated. In one earlier 


study (1) it was shown that Ss given 
a high incentive for the task set by E 
will acquire less incidental learning 
than Ss provided with a lower in- 


centive. These findings were related 
to cognition type learning theory by 
the following rationale: The S is 
usually under the simultaneous in- 
fluence of several goal demands, and 
only one of these may be directed 
toward the incentive provided by E£. 
If the several goal demands are nearly 
equal in strength, one may expect 
frequent changes of set. If one of the 
goal demands greatly exceeds the 
others in strength, less variability of 
S’s set is to be expected. Further, 
it may be assumed that “intentional 
learning,” or learning relevant to the 
task defined by FE, occurs whenever 
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the corresponding set is present, and 
incidental learning whenever S _ re- 
sponds with another set determined 
by another goal demand. When the 
incentive offered by E is increased, 
one would expect a stronger goal 
demand in relation to it, therefore a 
more persistent set and less incidental 
learning. With a smaller incentive 
a weaker goal demand would be ex- 
pected, therefore greater variability 
of set and more incidental learning. 

This same rationale would lead to 
the prediction that the rate of progress 
of incidental learning is quite different 
from, and perhaps somewhat com- 
plementary to, the progress of inten- 
tional learning. More specifically, it 
has generally been assumed that the 
strength of an incentive increases as 
it is approached, and that Ss show 
more persistent efforts directed to- 
ward mastery of a task as they near 
the completion of the task. This 
would suggest that relatively little 
incidental learning occurs during the 
later portions of the course of inten- 
tional learning, and relatively more 
during the early portions. 

The present experiment was de- 
signed to explore empirically the 
relations between the course of in- 
tentional and the course of incidenal 
learning. 


Metuop 


Subjects.—One hundred Ss satisfying a course 
requirement for elementary psychology were 
assigned alternately to five groups of 20 Ss each 

Apparatus.—The apparatus used was a 
memory drum which exposed serial lists at a 
3-sec. rate; a practice list consisting of seven 
meaningful adjectives, and an experimental list 
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consisting of 16 nonsense syllables. Each of the 
syllables was enclosed in a different geometric 
form, such as a circle, square, rectangle, etc. 
The geometric forms were drawn with India ink 
on a white background, and each form occupied 
an area of approximately } sq. in. 

Procedure.—All Ss were first trained on the 
practice list, with instructions to anticipate each 
word before it appeared in the drum window, 
beginning with the second trial. All Ss were 
trained on the practice list to the criterion of one 
perfect trial. 

After a rest period of 3 min., all Ss were in- 
structed to learn the experimental list in the 
same manner. Instructions were added to spell 
out each syllable, and no mention was made of 
the forms in which the syllables were enclosed. 

Group I learned to the criterion of four correct 
responses (25% of the list); Group II learned to 
a criterion of eight correct responses (50%); 
Group III to a criterion of 12 correct responses 
(75%); Group IV to a criterion of 16 correct 
responses (100%). The Ss of Group V each 
learned to a criterion of 16 correct responses plus 
half the number of trials required to reach that 
criterion (150%). 

Immediately after training was terminated, 
Ss of all groups were shown a 16-row by 16- 
column chart. Each row of the chart showed a 
particular syllable of the list enclosed in all of 
the possible 16 geometric forms used in the list. 
The syllables were arranged along the columns 
in the same sequence as during learning, and the 
sequence of forms used within each of the rows 
was randomized. The Ss were instructed to 
point out the geometric form associated with 
each syllable. 


RESULTS 


Figure 1 shows incidental learning 
at five stages of intentional learning. 
Since there were 16 possible forms we 
may assume that chance would ac- 
count for a total of one correct form 
identification for all of the 16 syllables. 
On this basis it is apparent that all 
groups showed significant incidental 
learning. It further appears that 
the greatest increments of incidental 
learning occurred during early (Group 
I) and late (Group V) stages of prac- 
tice, with little difference among the 
intermediate stages of practice. 


To test the over-all significance of 
differences among the groups, the inci- 


HARRY P. BAHRICK 











Fic. 1. Incidental learning at five stages of 


—intentional learning. 


dental learning data were subjected to 
an analysis of variance. The results 
indicate that the over-all group differ- 
ences are significant (F = 3.68, P < .01 
for df 4 and 95). Individual ¢ tests 
indicate that Group V (150%) differs 
significantly (P < .05) from all of the 
other groups, but the remaining groups 
do not differ significantly from one 
another. 

It is of interest to determine the 
relation between the degree of inten- 
tional learning of a particular syllable 
and the incidental learning of the 
geometric form in which that syllable 
was enclosed. To determine this rela- 
tion the correct number of responses 
given to each syllable was summed for . 
Ss of each group for all trials. Similarly, 
the total number of correct geometric 
form responses of each group was deter- 
mined for each syllable. The  cor- 
relations between these summed scores 
for intentional and for incidental learning 
associated with particular syllables were 
determined for each group. None of 
these correlations was significant at the 
.O5 level; they were .28, — .03, — .15, 
.07, and .30, respectively, for the five 
groups, suggesting considerable inde- 
pendence of the degree of intentional and 
incidental learning associated with par- 
ticular syllables. 

Post-experimental report revealed that 
a total of seven Ss attempted to memo- 
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rize the forms despite the fact that the 
forms were not mentioned in the in- 
structions. These Ss and their inci- 
dental learning scores are distributed as 
follows: Group I, two Ss with scores of 
6, 8; Group II, one S§ with a score of 10; 
Group III, three Ss with scores of 4, 7, 
and 5; Group IV, none; Group V, one § 
with a score of 12. 


Discussion 


The results suggest different types of 
acquisition functions for intentional and 
incidental learning connected with a 
particular task. In accordance with 
prediction it was found that little or no 
incidental learning occurred between the 
50% and the 100% level of intentional 
learning. At this stage of intentional 
learning there is reason to expect that the 
most persistent efforts are directed 


toward mastery of the task set by E. 
The large amount of incidental learning 
during the very early trials has been 
attributed to greater variability of set 
due to a more balanced strength among 
several co-existing goal demands. 


An- 
other reason for the greater variability 
of set during the early trials might be 
the fact that cognitions relating set to 

.the attainment of the goal are less 
firmly established at this time. 

It appears reasonable that the efforts 
directed toward intentional learning 
diminish during the trials devoted to 
overlearning. This should be particu- 
larly true after § has correctly responded 
to all the syllables for several consecutive 
trials and perceives the task as essentially 
accomplished. According to the ra- 
tionale developed earlier a less persistent 
set related to intentional learning would 
be expected at this stage, and conse- 
quently more incidental learning. The 
results substantiate the notion that the 
acquisition functions for intentional 
learning and incidental learning are 
different, and to some extent comple- 
mentary. 
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The interpretations developed here are 
similar to those developed by Bruner, 
Matter, and Papanek (3) in a recent 
paper dealing with the dimension of 
breadth of learning. 


SUMMARY 


The hypothesis was tested that incidental 
learning occurs primarily during those stages of 
intentional learning when motivation is rela- 
tively low. At these stages a greater variability 
of set is assumed as a result of a condition of 
near balance among several competing goal 
demands. 

To test this hypothesis 100 Ss were assigned 
to five groups and were trained in a serial 
memory task. The training of the various 
groups was interrupted after they had reached 
25%, 50%, 75%, 100%, and 150% of criterion, 
respectively. All Ss were tested for incidental 
learning immediately after their training was 
interrupted. It was shown that most incidental 
learning occurs during the very early trials, and 
again during the trials devoted to overlearning. 
It was concluded that incidental and intentional 
learning are to some extent complementary, 
with incidental learning occurring primarily 
when S is either uncertain of, or less motivated 
in relation to, the task set by £. 
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When S exerts a force on a control 
device, such as the control column of 
an aircraft, proprioceptive feedback 
information is a function of the 
particular spring stiffness, damping, 
and mass characteristics of that 
control (1, 2, 7). Of these three 
control loading variables, spring stiff- 
ness has received major attention, 
with interest directed particularly 


toward the relative effectiveness of 
force vs. amplitude cues in the use 
of a spring-centered control. 

Weiss presents two sets of data 
(10, 11) which suggest that variation 
in displacement of a control (ampli- 


tude cues) can significantly affect 
positioning accuracy while changes in 
the spring constants (force cues) did 
not appear to so affect performance. 
These results were not consistent with 
previous data by Gibbs and Baker 
(5) which show a superiority of a 
pressure stick (maximum force, mini- 
mum amplitude cues) over a free 
control (maximum amplitude, mini- 
mum force cues) in a continuous 
tracking task, nor were the Weiss 
data in agreement with Orlansky’s 
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deductions (9) which indicate spring 
centering and nonlinear torque 
changes as most desirable features for 
control devices. 

A report by Bahrick, Bennett, and 
Fitts (1) suggests a possible reason 
for this discrepancy : There appears to 
be a significant interaction between 
force and amplitude cues in a simple 
positioning task when that task in- 
volves both knowledge of results and 
extended practice. Thus, both force 
and amplitude cues can contribute to 
the skillful execution of simple posi- 
tioning responses, and the effective- 
ness of either set of cues is limited by 
the level of the other set. 

The present research attempted to 
determine the relative effectiveness of 
force and amplitude cues on perform- 
ance and on learning in a complex 
two-dimensional tracking task. This 
is the second of a series of studies to 
assess training task effectiveness in 
terms of performance on a transfer 
task which simulates the tracking 
situation present in a modern inter- 
ceptor aircraft. 


Metuop 


Apparatus.—The OSU Pilot Training Re- 
search Simulator (6) served as the skill task. 
This apparatus includes an analog computer 
that simulates certain characteristics of an 
interceptor aircraft with a radar fire-control 
system. ‘The S views a cathode ray tube (CRT) 
compensatory tracking display and attempts to 
remove an error set in by E at the start of each 
run, by manipulating a spring-centered control 
column. Lateral deflections of the control 
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generate a voltage, X, which is subjected to two 
stages of integration, the first representing rate 
of roll of the aircraft, the second simulating rate 
of change in heading. Fore and aft deflections 
generate another voltage, Y, which is subjected 
to an exponential oscillatory function plus one 
stage of integration, the combination repre- 
senting the underdamped response of an aircraft 
in the pitch dimension. 

The initial system input is a step function 
and one may consider the target “course” as 
being of zero relative velocity. The S’s goal is 
to remove the initial error by returning the 
target dot to the center of the CRT, the appro 
priate area being shown as a circle of }-in. 
diameter on the display. Movements of the 
target on the CRT result from two effects: (a) 
S’s control movements, which are subjected to 
the transformations cited above, and (b) a 
constantly increasing magnification of the error, 
which is achieved by multiplying the Y and Y 
error components by 1/7, where T represents 
time-to-go in a trial and is continuously variable 
from 40, at the beginning, to 1 at the end of each 
trial. More detailed descriptions of the ap- 
paratus appear elsewhere (3, 6). 

Subjects—The Ss were 68 freshmen and 
sophomore male students. All were volunteers 
from the Air Command Squadron of the Air 
Force ROTC unit at The Ohio State University, 
none of whom had received prior flight training. 
Each S was paid for five experimental sessions 
of approximately 45 min. each. 

Procedure.—During the first session E gave 
S a demonstration of the task requirements, 
following which each S received 10 40-sec. trials 
of tracking with 20-sec. rest between trials and 
5-min. rest between blocks of five trials. The 
Ss served in pairs; one S tracked for five trials 
while his companion observed, then the second 
S tracked for five trials, and soon. During the 
following four sessions each S received a total 
of 20 40-sec. trials per session. A 24-hr. interval 
separated each of the five sessions. ‘There were 
a total of 60 training trials and 30 transfer trials. 
Transfer was made halfway through Session 4. 

The Ss were assigned at random to one control 
and three experimental groups of 17 Sseach. A 
systematic variation of the force and amplitude 
cues present in the control column defined 
groups. A summary of these conditions during 
the 60 training trials for lateral control deflec- 
tions is given in Table 1. The vertical or pitch 
dimension of the tracking task was modified in 
an identical manner. 
control 


Group 1 served as the 
and these Ss trained with a 
control column involving force and amplitude 
cues approximating those found in an interceptor 
aircraft at 500 kts. and at 35,000 ft. Columns 
4 and 5 present physical specifications of the 


group 
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TABLE 


Force-DiseLacement Conpirions FOR 
Rott Maneuvers 


| | | Lateral 
Centering 
Amplie | porce 
| tude | “Ub ment 
} Force 
| | Ib. /deg.) 


| 
Group | Force 


tivity® 


10 

40 
40 
| ' 


* These are arbitrary values for computer sensitivity 
(Se) or “aerodynamic sensitivity’ for roll (deg./sec.* 
per degree of control displacement) Exact specifi 
cations appear elsewhere ( 


| 
| 10 


force cues while Column 6 indicates the relative 
level of “aerodynamic” or control sensitivity, 
S., of the tracking system. As shown in 
Columns 2 and 3, the “normal” control cues are 
designated as “1.” normal cues were 
those present in the control column used in a 
previous study (3), 


These 


Group 2 received their training trials with a 
control in which the force cues (relative torque 
change) were attenuated by a factor of 4, com 
pared to the “normal” stick. This 
complished by reducing the spring constant, 
as shown in Table 1. Computer sensitivity to 
a given amount of control column movement was 
unchanged, leaving the amplitude cues at the 
same level as for Group 1. Therefore, Ss in 
Group 2 were required to displace the control to 
the same extent as Group | in order to achieve 
the same system response, but at those ampli 
tudes Group 2 experienced only one-fourth the 
force feedback in terms of relative torque change 


was ac 


For Group 3 the spring constant was in 
creased to four times that for Group 1 while 
computer sensitivity also was increased to four 
times “normal.” ‘These Ss now deflected the 
control only one-fourth the amplitude required 
of Group 1 for the same system response, but at 
that amplitude Group 3 experienced the same 
total force feedback as that received by the 
control group at four times the amplitude 
Finally, Group 4 tracked with a control column 
in which both force and amplitude cues were 
reduced as compared to Group 1. For Group 
4 the spring constant present in the control 
column for Group | was used, but the computer 
sensitivity was increased by a factor of four over 
There 
fore, these Ss ac hieved the same system response 
by deflecting the control one-fourth the ampli 
tude needed by Group 1, but at that amplitude 


that experienced by the control group 
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Training and transfer performance levels for all groups over the five experimental days. 


Transfer was affected during Session 4. 


Group 4 experienced only one-fourth the force 
change received by Group 1. 

During the 30 transfer trials, Groups 2, 3, 
and 4 tracked with the same control column force 
and amplitude cues as the control group, Group 
1. Thus, the experiment provides two sets of 
related data: (a) during the training trials it is 
possible to assess the influence of changes in 
force and amplitude cues on tracking perform- 
ance, and (b) during the transfer trials it is 
possible to determine the influence of force and 
amplitude cues on the learning of this skill task, 
independent of the specific effects of these cues 
on performance during training. 

Tracking proficiency was measured in terms 
of time on target simultaneously in Y and Y. 
The scoring area was a square } in. x } in. at 
the center of the CRT; this approximates the 
target circle which was } in. in diameter, ‘This 
score was cumulated over the final 3 sec. of each 
tracking trial. A previous study (3) indicated 
that particular period of scoring as satisfactory 
for this skill task. 


RESULTS 


The training and transfer data for 
all groups over the five sessions of the 


Maximum possible score was 3 sec. 


experiment are presented graphically 
in Fig. 1. Each point is the mean 
performance of 17 Ss as determined 
by the time-on-target scores averaged 
for five-trial blocks. There were two 
blocks of trials on Session 1 and four 
blocks on each of the remaining four 
sessions. ‘Transfer was made midway 
through Session 4. 

Training performance.—During the 
60 training trials, Group 1 shows a 
consistent superiority over the three 
experimental groups, with Group 2 
enjoying a slight superiority over 
Groups 3 and 4. These latter two 
groups exhibit no consistent superi- 
ority one to another. It appears, 
then, that reduction in force and/or 
amplitude cues can influence tracking 
performance. 

In order to assess the relative im- 
portance of force vs. amplitude cues 
on performance, a simple 2 XK 2 





FORCE AND AMPLITUDE CUES IN TRACKING 


TABLE 2 


ANALYSIS OF VARIANCE FoR TRAINING BLocks 
11 Pius 12 


Source | @& MS 


509.23 
2639.59 
972.86 
204.66 


Force (F) 


l 
Amplitude (A) l 
FXA l 
Residual oF 


*P < 05. 


analysis of variance was performed 
on the data for the final 10 training 
trials. For this analysis the data for 
Blocks 11 and 12 were averaged for 
each S in each group. The results of 
this analysis are presented in Table 
2. The lack of statistical significance 
(P > .05) for the Force variable is in 
agreement with previous analyses by 
Weiss (10, 11), and considered alone, 
this would suggest that only Ampli- 
tude was the important (P < .0O1) 
variable in this tracking task. How- 
ever, the statistical significance of the 
Force X Amplitude interaction (P 
< .05) qualifies these conclusions. 
This interaction implies that both 
force and amplitude cues influenced 
tracking proficiency here, and Table 
3 shows this quite clearly. 

In Table 3 the time-on-target 
scores were averaged for groups and 
then arranged according to the levels 
of the force and amplitude variables 
used in this experiment. The Duncan 
Multiple Comparisons Test (4) was 
applied to and the 


these means, 


TABLE 3 


Averace Time-on-Tarcetr Scors 
Traininc Biocks 11 anp 12 
(Maximum possible score = 3.0 sec.) 


OVER 


Relative Force 
Relative 
Amplitude 
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results show that Group | perform- 
ance (2.36 sec. on target) differed at 
the .O1 level of confidence from the 
performance levels of Groups 2, 3, 
and 4, the latter three groups ex- 
hibiting no significant differences one 
from another. 


changes in 
affected 


that 
significantly 


Thus, it 
amplitude 
performance but only force cues 
were high (had a value of “1"’) not when 
force cues were reduced (had a value of 
“}""), and likewise changes in force cues 
significantly affected performance when 
amplitude cues were high, but not when 
these latter cues low level. 
This significant interaction confirms the 
expectation based on the data of Bahrick, 
Bennett and Fitts (1): force and ampli 
tude interact to influence the levels of 
performance in skill tasks which involve 
controls with spring loading. 

It is to be noted, however, that the 
particular pattern of differences found 
here to define the Force X Amplituds 
interaction differed from that pattern 
noted by Bahrick, Bennett, and Fitts. 
In that earlier study, force cues did not 


appears 
cues 
when 


were at a 


appear to differentiate performance at 
high levels of the 
(70° movements of a 
control) but did have a significant effect 
on lower levels (35° and 17.5° displace 
ments). It appear that a par 
ticular interaction pattern is dependent 
primarily upon the slope of the function 


amplitude variable 


unidimensional 


would 


relating spring stiffness to control dis 
placement and the portion of that fun 
tion present in a 
Therefore, the statistical significance of 
the interaction, per se, is the important 
finding at this time. 
ditional research is needed to define the 
this 


given experiment 


Considerable ad 


nature of interaction 


pletely. 


more com 


Transfer performance.By visual 
inspection of Fig. | it can be 


that a high degree of positive transfer 


seen 


occurred for each of the three experi- 
mental groups. The 
this observation was substantiated by 


significance of 
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TABLE 4 
Anatysis oy Variance ror Traininc Biocks 
11 Pius 12 ano Transrer Biocks 
l Pius 2 
Source U MS 


2319.02 
623.72 


Groups (G) 


Ss/G 


Training-Transfer 
(T-T) 

TT XG 

T-T X S8/G 


11 492.52 


113.11" 
6.96" 


Blocks (B) 


111.90 
24.54 
54.28 


an analysis of variance for the data of 
Training Blocks 11 and 12 and for 
Transfer Blocks | and 2. The results 
of this analysis are presented in Table 
4. The Training-Transfer compari- 
son shows a highly significant effect. 
A more analytic expression of this 
transfer is justified by the significance 
of the ‘Training-Transfer X Groups 
interaction, i.e., application of the 
Duncan ‘Test to the eight mean scores 
defining this interaction gives the 
results listed in Table 5. Those means 
shown within brackets do not differ 
significantly, one from another, while 
those means not within a bracket do 


TABLE 5 


\verace ‘Time-on-Tarcetr ror TRAINING 
Biockxs Il Pius 12 ann Transrer 
Biocxs | Pius 2 
(Maximum possible score = 3.0 sec.) 


Group | Transfer 


|: oe 
| 2.36 
2.21 


231 


Training 
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differ at the .05 level. As shown 
earlier in Table 3, Group | is signifi- 
cantly superior to the three experi- 
mental groups at the end of the 
training sessions. During the trans- 
fer trials, however, this difference is 


diminished markedly. It is con- 


cluded, then, that near 100% transfer 
was found for each of the three ex- 
perimental groups. 

A convenient index to express this 
high degree of positive transfer is 
dat x 100, where C, is 
1 


Cr — "ie the 
initial performance level of Group | 
(.6 sec.), Ep is the transfer perform- 
ance level of an experimental group 
(2.36, 2.21, and 2.31 sec. for Groups 
2, 3, and 4, respectively), and Cy is 
the performance of the control group, 
Group 1, at the time of transfer (2.54 
sec.). These indices are 90%, 83%, 
and 88% for Groups 2, 3, and 4, 
respectively. 


Discussion 


The data presented in Fig. 1 indicate 
that whereas the particular combination 
of force and amplitude cues present in a 
control can set rather definite 
tracking performance level, 
there apparently is little retardation in 
the learning of the skill task as compared 
to learning under either more or 
“favorable” control characteristics. 

Transfer effects—This latter finding is 
in accord with an hypothesis stated in 
an earlier report by the authors (3). 
There it was proposed that practice with 
any particular range of response ampli 
tudes should involve near 100°, transfer 
to a task permitting a different range of 
response amplitudes provided (a) no 
important system transformations were 
changed from the training to transfer 
tracking systems, and (4) the responses 
practiced during training were not quali- 
tatively different from those required 
during transfer. 

The first restriction was met in the 
present study as the system transforma- 


column 
limits on 


less 
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tions for both elevation and azimuth 
corrections were constant from training 
to transfer tasks (only the values of the 
constants for the mathematical 


ations were changed). 


oper- 
The second re 
striction was met in that Groups 3 and 
4 trained on a system requiring control 
displacements of only lesser amplitudes 
than those of Group 1—the training and 
transfer responses were quantitatively 
not qualitatively different. 
utilized a control permitting substan 
tially the same range of displacement 
amplitudes as for Group 1. 

Training effects.—Ilt is clear from Fig. 
1 that 4-to-1 changes in amplitude cues 
produced a greater effect on tracking 
performance (Group 3 vs. Group 1) than 
a comparable ratio of change in force 
cues (Group 2 vs. Group 1). This is in 
agreement with previous data (1, 10, 11). 
It is the purpose of this section to show 
that the particular order of proficiency 
levels, found here for the four groups 


Group 2 


during training, is a logical order and 
one consistent with previous information 
on force discrimination. 

Bahrick, Bennett, and Fitts (1) pro 
data from their 
study and that of Jenkins (8), that the 


pose, on the basis of 


ratio yrovides an index predictive 


AF 

F(AD) ' 
of accuracy in 
centered control. 


positioning a 
Here, F is the force 
required to achieve a specified control 
displacement, and AF is the force change 
associated with some given displacement, 
AD. The empirical data suggest that as 
this ratio js constant 
displacement, positioning accuracy In 
creases. 

An interpretation of this ratio is 
facilitated by considering the force 
displacement plot shown in Fig. 2 for the 


spring 


increased for a 


four spring-amplitude conditions used in 
the present 
were 


study. These functions 
determined empirically for the 
springs involved in lateral control de- 
flections; similar functions hold for all 
directions of stick displacement. For 
purposes of discussion we will consider 
the spring conditions at 8° displacement 
of the control column for Groups 1 and 





Force ( ib.) 











z 8 6 
Displacement ( deg ) 


Fic. 2. The 
encountered by the 


force-displacement relations 


groups during training 
Points indicated refer to typical displacement 


during terminal phases of each trial 


2 and at 2° for Groups 3 and 4. These 
values of AD are typical of the ampli 
tudes of control deflection utilized by § 
during the latter portion of a tracking 
trial. 

At these two values of AD, AF is 8, 2, 
4, and 2 lb., and F is 10, 3, 10, and 4 |b. 
for Groups 1 through 4, respectively. 
Thus, the ratios are .1, .08, .20, and .25 
for Groups 1 through 4. For a 
displacement, AD = 8°, the 
and .08 predict that Group 1 will be 
superior to Group 2, and for AD = 2 
the ratios .20 and .25 predict that Group 
4 will be superior to Group 3 in tracking 
performance. According to  Bahrick, 
Bennett, and Fitts (1), however, pre 
diction of the force 
improving the accuracy of positioning 
movements depends not only upon this 
ratio (relative torque change per unit 
distance), but also upon the absolute 
force change involved in 
flections. 

When S§ applies force to a control, 
Jenkins (8) found that the relative DL 
decreases significantly from 1 to 10 Ib., 
so that within this range the larger 
absolute values of force change are much 


given 
ratios .1 


utility of cues in 


control de 


more useful than smaller values, given a 
particular ratio of relative torque change. 
Thus, in comparing the conditions for 
Groups 1 and 2, Group 1 encountered 
the larger absolute changes in force, and 
sO superior performance for this group 
over that for Group 2 
predicted. 


would again be 


In comparing the force-dis 
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placement conditions for Groups 3 and 
4, however, Group 3 experienced the 
larger absolute change of force, while 
Group 4 encountered the larger relative 
change per unit of displacement, as 
noted above. On the basis of these 
counteracting conditions, then, a clear- 
cut prediction of the relative tracking 
accuracies of Groups 3 and 4 is not 
possible, and the present finding of no 
significant differences between these two 
groups during training (Table 5) is not 
unexpected. 


SUMMARY 


Four groups of male Ss served for 60 training 
and 30 transfer trials in a study designed to 
assess the influence of force and amplitude cues 
from a control column on learning and perform 
ance in a_ two-dimensional 
tracking task, Group | trained with a control 
in which the force-displacement relation was 
known (3) to permit relatively good performance 
with the tracking system. 


compensatory 


Group 2 trained on a 
control which involved one-fourth the total force 
change, but which permitted displacement of 
The 
control column used by Group 3 involved the 
same total force feedback as that for Group 1, 
but amplitudes of displacement were reduced to 


the same amplitudes as for Group 1. 


one-fourth. Finally, Group 4 trained on a 
control which had both force and amplitude cues 
reduced to one-fourth those present for Group 1. 
\fter the 60 training trials all groups practiced 
for an additional 30 trials on the control column 
used by Group I. 

Over the 60 training trials Group 1 was sig 
nificantly superior to the three experimental 
groups with Group 2 being slightly superior to 
Groups 3 and 4. On the first block of transfer 
trials there were no significant differences among 
the four groups, thus indicating near 100% 
transfer for the three experimental groups. 

Analysis of the 


training revealed 


final 
and 


during the 


both 


scores 


trials that force 


amplitude cues significantly affected perform 
ance, amplitude cues apparently exerting the 


greater influence. These performance results 


were shown to be predictable from the ratio 


AF ; 
- , where AF is a force change associated 
F(AD) 
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with a given displacement change, AD, and F 
is the terminal force required to attain the 
displacement. 
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EFFECT OF VARIATION IN ASSOCIATIVE FREQUENCY 
OF STIMULUS AND RESPONSE MEMBERS ON 
PAIRED-ASSOCIATE LEARNING! 


GEORGE MANDLER AND ENID H. CAMPBELL 


Harvard University 


The frequency of associations elic- 
ited by verbal materials has been 
shown by several investigators to 
exert a significant effect upon verbal 
learning. Noble’s index of meaning- 
fulness (m), which is determined by 
the number of associations elicited 
during a l-min. period, has been 
shown to be related to the rate of 
serial learning (7) and the rate of 
acquisition of paired associates (2, 8). 
Similarly, paired-associate learning of 
nonsense-syllable pairs is related to 
the index of associative frequency, 
determined by the number of asso- 
ciations elicited 
period (5). indicate 
that paired-associate acquisition is a 
positive function of the number of 
associations elicited by the stimulus 
and response members of the paired 
associates. 


during a 30-sec. 


These studies 


This paper will present two experi- 


ments in which fre- 
quencies of the stimulus and response 


members of nonsense-syllable paired 


the associative 


associates are varied independently. 
Noble has indicated that the 
ciative frequency variable can be 
related to variations in meaningful- 
ness. Thus, meaningful words show 
a greater frequency of associations 
than nonsense words (6). In an 
unpublished study, Sheffield (9) has 
investigated the effect of two levels 


asso- 


' This research was supported by a grant from 
the Laboratory of Social Relations, Harvard 
The authors wish to thank Dr. 
Robert P. Abelson for his advice on the statistical 
treatment of the data, and Drs. F. D. Sheffield 
and C. E. Noble for other helpful suggestions. 


University. 


of meaningfulness, three-letter words 
and three-letter 
on rate of acquisition in 
associate learning. 


nonsense syllables, 
paired- 
Two of Sheffield’s 
findings are relevant to the present 
study. The first was that learning 
was greatly facilitated by the presence 
of a meaningful word in the response 
position, while in the stimulus posi- 
tion the trend toward facilitation by 
the presence of a meaningful word 
was smaller and nonsignificant. The 
other result indicated that prior 
familiarization with nonsense syllables 
has a definite facilitative effect when 
the syllable is used in the response 
position, but a negative 
effect when it is used in the stimulus 
position. 

The two experiments presented 
below are concerned with the effect on 
paired-associate learning when both 
stimulus and 


transitory 


response members are 
varied on three levels (high, medium, 
and low) of associative frequency, and 
with the effect of prior associative 
activity of S, particularly when he is 
required to produce associations to the 
syllables used in the learning task. 


Metruop 


Design.—Experiments I and II both investi 
gated the acquisition of paired-associate non 
sense syllables. They differ 
lists and dependent measures of 
utilized. All syllables were selected from the 
list of 100 consonant 


whose associate frequency 


in the lengths of 


acquisition 


vowel-consonant syllables 
been 
The values of these 


values have 
published elsewhere (4) 
100 syllables vary from 2.9 to 5.3 associations 
elicited in a 30-sec High, medium, 


and low associative-value syllables were obtained 


interval 
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by starting the selection procedure from the two 
ends of the distribution and on both sides of the 
median. Syllables were included in any one 
list such that for any one pair the same letter 
did not appear in both the stimulus and response 
member, and that no pair of adjoining letters 
appeared in more than one syllable. In addi- 
tion, in Exp. I the initial letter of a stimulus 
syllable did not appear in the initial position in 
any other stimulus syllable. 

In Exp. I three lists (A, B, and C) of six pairs 
were used. The resultant 18 pairs consisted of 
two instances of each of the possible nine 
combinations of high, medium, and low stimulus 
members (S,, S,, S;) and high, medium, and 
low response members (Ri, Rm, Ri). Thus, 
List A contained two pairs each representing 
the Si Ry, Sa = and S; Ry condition. List B 
contained the Sy-Rm, Sm-Re, and the §S)-R, 
conditions. List C represented the Sy-Ru, 
Sm-Ri, and Si;-Ry conditions. One half of all 
the Ss given any one list were presented with an 
association task prior to the learning task, the 
other half were given the association task sub 
sequent to learning. ‘The association task con- 
sisted of 20 nonsense syllables different from 
those used in the learning task, and Ss wrote 
down all the associations elicited by these syl 
lables during a 30-sec. interval. 

In Exp. II one list of 18 pairs was used, two 
pairs for each possible combination of high, 
medium, and low stimulus and response mem- 
bers. The Ss were randomly assigned to three 
groups. One group was given, prior to the 
learning task, an association task which used 
the 36 syllables used in the learning task. A 
second group was given, prior to learning, 36 
syllables different from those used in the learning 
task. The third group was given no prior 
association task. 

‘Table | shows the mean associative frequency 


TABLE 1 
Mean Associative Frequency or Stimutus 
AND Response Terms 


Stimulus Response 


' 
| 
| High | Med. Low High | Med. | 


| 49 | 4.0 | 3.6 
| 49 | 40 | 3.5 


values of the stimulus and response members in 
the two experiments, 

Procedure.—The lists were presented by 
means of a Hull-type memory drum. The 
stimulus member was exposed for 1.5 sec., 
followed by a .5 sec. interval during which both 
shutters were closed, and a 1.5 sec. exposure of 
the response member, with another .5 sec. 
interval during which both shutters were closed. 
Correct responses were scored as anticipations 
of the response member prior to its exposure. 
The lists in Exp. I were presented in six different 
randomized orders, which sequence was repeated 
starting with the seventh presentation, and so 
forth. In Exp. II three randomized orders of 
the list were repeated in a similar manner. The 
Ss in Exp. I continued on the learning task until 
they achieved at least one correct response for 
each pair in the list. In Exp. I], where due to 
the length of the list a criterion of correct re- 
sponses was not feasible, all Ss were given 30 
trials. 

In the association task S was successively 
presented with the nonsense syllables printed 
on 3 X 5-in. filing cards. Each syllable was 
presented for 30 sec., during which time S wrote 
down single word associations. 

Subjects and instructions.—A total of 66 
volunteer paid undergraduate Ss were used. 


TABLE 2 


Means ano SD’s or Trtacs to Firsr Correct Response in Exe 


Mean 

97 
19.0 
17.3 


10.9 15.3 


Stimulus Terms 


SD 

1.3 
10.8 
13.0 


10.5 


Note Data from List A is indicated in normal Arabic numerals, from List B in ilalics, and from List C in 
bold face. The analysis of variance in Table 4 is based on a logarithmic transformation of these data 
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TABLE 3 


Numper or Correct Responses in Expr. Il 


Stimulus Terms 
Response 
Terms 
Sh 


Ry 17.9 
Rim 11.8 13.4 
R, 13.8 8.2 


Total | 145 10.7 


Note :—The analysis of variance in Table 4 was based 
on the frequency of correct response to two paired 
associates representing each S-R combination. In this 
table these values were halved to present mean fre- 
quency for single paired associates. 


Thirty-six Ss participated in Exp. I, 12 Ss for 
each of the three lists; and 18 Ss in each of the 
two association conditions. Thirty Ss partici- 
pated in Exp. II, 10 Ss in each of the three 
association conditions. All Ss 
standard paired-associate 


were given 
instructions, and 
association instructions similar to those used in 
the standardization study (4). ‘The connection 


between the two tasks was not explained to Ss. 


RESULTS 


Tables 2 and 3 show mean values 
of the learning scores for Exp. I and 


II. They indicate the mean value of 
all pairs in the particular stimulus or 
response category, e.g., in Exp. I the 
six pairs with a high frequency 
stimulus member showed an average 
first correct response on Trial 12.9. 
Table 4 summarizes the analyses of 
both experiments. In Exp. I the 
learning variable used was the number 
of the trial in which S gave the first 
correct response for a 
nonsense syllable pair. 
the 
measure was used, 


particular 
For purposes 
logarithm of this 
In Exp. II, the 
dependent measure was the number 
of correct responses given in 30 trials 
(presentations). 
the nine possible stimulus-response 


of analysis 


Since every one of 


combinations was represented by two 
pairs, the values for these parallel 
pairs were combined. Learning 
effects were estimated by using suc- 
cessive blocks of six trials (T). 

The analyses of variance show 
agreement only on the two major 
experimental variables, variation of 
stimulus and of response members. 


TABLE 4 


SUMMARY OF 


Experiment I* 


Source 
Error | 
df Rome MS 
0 | .OR5 
l 195 
1 307 
l 073 
l 
l 
l 


. Groups (G) 

. Stimuli (S) 
Responses (R) 
SxXR 

5 055 
093 
O12 


ee 


1 
l 
1 
I 
l 
l 
l 


NmMNMNNNw Nh 


. | 


Between Ss Error ( | .141 
. Within Ss Error | 029 
.Ssx5S 
. Ss*R 
.-SsxXSX*R 
5. SsxX T 


* In Exp. I lists are confounded with Stimulus K Response interactions 


founding effects of lists removed 
O1 level. 


ANALYSES OF VARIANCE OF 


Exe 


Experiment Il 


Error 
Term 


=~ 


MS 


Anmumvw 


eS he ee & 


NaS OS Se SNR 


All effects are 


reported with the con 


It should be noted, however, that variance due to lista was significant at the 
Gourlay (1) has recently pointed out that in this latin-square type design, such a finding might indicate 


that the variance associated with the Stimuli, Responses, and Stimuli * Responses effects is inflated 
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These two sources of variation are 
significant in both experiments at 
better than the .O1 level. Tables 2 
and 3, however, indicate that while 
response variation in both cases shows 
increasing facilitation as associative 
frequency is increased from low to 
medium to high, stimulus variation 
shows this effect only in Exp. II. In 
xp. I, the S, members are learned 
significantly faster than the S,, mem- 
bers, but not significantly better than 
the S; members. 

Two other significant findings are 
shown in Table 4. Neither of these 
is replicated in both experiments. In 
Exp. I, there is a barely significant 
interaction between Response vari- 
ation and Association groups. This 


significant interaction is associated 


with faster learning of the R,, and 
R; pairs when learning is preceded 
by the association task, with no such 
difference for the Ry pairs. 

The other finding, in Exp. II, shows 
a significant Stimulus-Response inter- 


This interaction effect is 
apparently primarily effected by three 
of the stimulus-response combina- 
tions. It is apparent from Table 3 
that the S,-R; pairing produced 
unusually fast acquisition, particu- 
larly when compared with the com- 
bination of the R; member with §S,, 
and S; members. 


action. 


This rapid acquisition of S,-R; pairs 
is in contrast with an incidental datum 
from Kimble and Dufort (2). These 
investigators found that lists using 
Noble’s disyllables and three-letter words 
were more easily learned when the less 
meaningful disyllables were in the 
stimulus position and the three-letter 
words in the response position than when 
this relationship was reversed. Simi- 
larly Sheffield’s data show that pairs 
with nonsense syllables as stimuli and 
meaningful responses are 
learned faster than pairs with meaningful 
words in the stimulus position and 


words as 
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nonsense syllables in the response posi- 
tion (7). These findings would imply 
in the present study that the S)-R, 
pairs should be learned faster than the 
Sn-R; pairs; however, the opposite is the 
case.2- One possible explanation of these 
discrepant findings is that in both the 
Kimble and Dufort and the Sheffield 
studies, the Ss had to learn “how to 
respond” when the more difficult (less 
meaningful) items were in the response 
position. Both the majority of the 
disyllables used by Kimble and Dufort 
and the nonsense syllables used by 
Sheffield required S to learn some novel 
combinations of sounds, i.e., introduce a 
selective effect of response learning (3). 
This requirement might increase acqui- 
sition time for nonsense responses as 
against meaningful, previously practiced, 
responses. In the present study this 
learning of the response was required in 
all conditions. Thus the saving from 
previous experience would not be a 
factor which would adequately facilitate 
the acquisition of more meaningful 
responses. 

The other two values contributing to 
the significant S * R interaction are the 
Si-Ry, pairing which shows a somewhat 
lower rate of acquisition than expected, 
and the S,,-R,, pairing which shows a 
somewhat high rate of acquisition. In 
this connection it should be pointed out 
that the S,-Ry, Sm-R», and S)-R; pairings 
show the expected decreasing rate of 
learning. This is consistent with the 
previous research (2, 5, 8). 

None of the results indicates any 
simple effects on learning as a function 
of previous exposure to the learning 
material. Neither the presentation of 
identical material in a prior association 
task, nor an association task with 
different materials shows any, even 
suggestive, effect on rate of learning in 
these two experiments. 

The present results indicate that 
associative frequency is an important 


2 It should be noted that in Exp. I, where the 
Stimulus-Response interaction shows a level of 
significance of only .10, the data are in the 
direction of the Kimble and Dufort findings. 
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factor in paired-associative learning. 
This variation is effective in both the 
stimulus and the response members of 
nonsense syllable pairs., However, only 
in the response member was it a con- 
sistent function of associative frequency 
in both experiments. Variation of the 
stimulus member showed a similar effect 
in Exp. II, but in Exp. I no such simple 
function appears. This may have been 
partly due to confounding effects in the 
design of Exp. I. This discrepancy is 
particularly important in view of Shef- 
field’s previous findings of a consistent 
effect of the variation of meaningfulness 
on the response side, but not on the 
stimulus side. However, it should be 
noted that Sheffield’s criterion of mean- 
ingfulness was the presence of the word 
in a dictionary. Thus his meaningful 
items had a high degree of familiarity 
which is absent for the syllables in the 
present studies. 

The failure to find any consistent 
significant effects associated with prior 
associative activity on the part of the Ss 
must be contrasted with Sheffield’s 


findings of a significant facilitative effect 


of extensive familiarization when a 
nonsense syllable is used in the response 
position. The 30-sec. association period 
may have been quite insufficient to pro- 
vide § with adequate experience with 
the nonsense syllable. 


SUMMARY 


Two experiments were performed in which Ss 
learned nonsense-syllable paired associates. 
The pairs in both experiments represented three 
degrees of associative frequency (high, medium, 
and low) in both the stimulus and the response 
members. In addition, different groups of Ss 
were given different association tasks prior to 
the learning task. The main findings were: 

1. Variation in associative frequency of the 
response member showed increasing facilitation 
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in learning as a function of increasing degrees of 
associative frequency. 

2. While variation of associative frequency 
of the stimulus members also showed a significant 
effect on acquisition, this effect was not con- 
sistently in the direction of a simple association 
between associative frequency and learning. 

3. The presence or absence of prior asso- 
ciation tasks did not significantly affect acqui- 
sition in the learning task. 


REFERENCES 


. Gourtay, N. 
design of the latin square type. 
metrika, 1955, 20, 273-287. 

. Kiwece, G. A., & Durort, R. H. Meaning- 
fulness and isolation as factors in verbal 
learning. J. exp. Psychol., 1955, 80, 
361 368. 

. Manpuer, G. 
learning. 
244. 

. Manoier, G. Associative frequency and 
associative prepotency as measures of 
response to nonsense syllables. Amer. J. 
Psychol., 1955, 68, 662-665. 

. Mannuer, G., & Hurrentocuer, J. The 
relationship between associative fre- 
quency, associative ability and paired- 


Amer. J. Psychol., 


F-test bias for experimental 
Psycho- 


Response factors in human 


Psychol. Reo., 1954, 61, 235- 


associate learning. 
1956, 69, 424-428. 

. Nopie, C. E. An analysis of meaning. 
Psychol. Rev., 1952, 59, 421-430. 

. Nosre, C. E. The role of stimulus meaning 
(m) in serial verbal learning. J. exp. 
Psychol., 1952, 43, 437-446; Erratum, 
1952, 44, 465. 

. Nosie, C. E.,& McNeery, D. A. The role 
of meaningfulness (m) in paired-associate 
learning. J. exp. Psychol., 1957, $3, 
16-23. 

. Suerrietp, F. D. The role of meaningful- 
ness of stimulus and response in verbal 
learning. Unpublished doctor's 
sertation, Yale Univer., 1946. 


dis- 


(Received September 28, 1956) 





Journal of Experimental Psychology 
Vol. 54, No. 4, 1957 


STUDIES OF DISTRIBUTED PRACTICE: XVII. 


INTERLIST 


INTERFERENCE AND THE RETENTION OF PAIRED 
CONSONANT SYLLABLES !? 


BENTON J. UNDERWOOD AND JACK RICHARDSON 


Northwestern University 


A previous study (13) demon- 
strated that retention of serial con- 
sonant syllable lists was inversely 
related to the number of lists learned 
prior to the acquisition of the list 
tested for retention. Fu:thermore, 
in that study there was an interaction 
between intertrial interval and the 
number of previous lists learned, 
namely, massed practice of the first- 
learned list gave better retention than 
did distributed practice but just the 
reverse was true for the seventh list 
in the series. On the assumption that 
as number of previous lists increase 
interlist interference increases, the 


results indicated that massed practice 


facilitates retention when _ interlist 
interference is low and distributed 
practice facilitates when this inter- 
ference is high. 

It is commonly found that retention 
of paired-associate lists is better 
following distributed practice than 
following massed practice (4, 7, 8) 
but the differences are not usually 
statistically significant. In_ these 
studies S served in counterbalanced 
conditions and learned a total of four 
lists so there was some possibility for 
interlist interference. In comparable 
experiments with serial lists (5, 6, 10) 
massed practice was followed by 
better retention than was distributed 
practice. Other analyses (11) sug- 


1R. W. Schultz supervised the gathering of 
the data. 

? This work was done under Contract N7onr 
45008, Project NR 154-057, between North- 
western University and the Office of Naval 
Research. 


gest that interlist interference effects 
are greater for paired-associate lists 
than for serial lists. 

The above facts suggest the possi- 
bility that whether or not distributed 
practice facilitates retention is due 
entirely to amount of interlist inter- 
ference and that the type of learning 
involved is not a critical variable. 
Moreover, these facts suggest that the 
“necessary” amount of interlist inter- 
ference needed to produce signifi- 
cantly better retention following 
distributed practice of paired-asso- 
ciate lists would be present with 
fewer lists than in the case of serial 
learning. ‘To obtain data bearing on 
this issue the amount of interlist 
interference is varied in the present 
experiment by varying the number of 
lists learned previous to the learning 
of a list by distributed practice. At 
one extreme there are six previous 
lists (thus duplicating the serial 
experiment), at the other, one. In 
addition, the lists have been con- 
structed in the same the 
serial that formal 
similarity is high. 


way as 


lists so interlist 


Metuop 


Lists.—Each of seven paired-associate lists 
was constructed of four pairs of consonant 
syllables taken from Melton (2). All syllables 
were between 50% and 75% association value 
as calibrated by Witmer (14). The 56 syllables 
contained a total of nine different consonants 
Fach of the nine consonants appeared from one 
to four times in each list. The lists were pre 
sented on a Hull-type memory drum at a 2: 2-sec. 
rate and learning was by the anticipation method 
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in which S spelled the syllable. Six orders of 
presentation were used to prevent serial learning. 

Design.—A total of 336 
as Ss. These were divided six 
groups of 56each. ‘Two of these groups learned 
a total of 7 lists, two learned 4 lists, and two 
learned 2 lists. 


undergraduates 


served into 


For each number of lists there 
was a massed learning group and a distributed 
learning group. The procedure for the 7-list 
groups parallels that of the previous serial 
experiment. The distributed group learned the 
first list with a intertrial interval. 
Twenty-four hours later this list was recalled 
and relearned and then two additional lists were 
learned by 
interval). 


30-sec. 


massed practice (4-sec. intertrial 
On the third day three more lists 
were learned by massed practice. On the fourth 
day a single list was learned with a 0-sec, 
intertrial interval, and 24-hr. later this list was 
recalled and The 
massed group went through exactly the same 
schedule except that all lists were learned by 
massed practice. 


relearned. corresponding 


The 4list groups proceeded the same as the 
7-list groups through the first two days. On the 
third day a single list was learned; this list was 
then recalled and relearned 24 hr. later. One 
of the groups learned all four lists by massed 
practice, the other learned the first list with a 
30-sec. intertrial interval and the fourth with a 
()-sec. interval, with the two “‘filler’’ lists being 
learned by massed practice. 

The 2-list groups learned and recalled the 
first list as did the other groups. However, on 
the second day, after relearning the first list, a 
second list was learned which was recalled 24 hr. 
later. One group learned both lists by massed 
practice, the other with the 30-sec. and (&-sec. 
intertrial intervals for the first and second list, 
respectively. 

The six groups will be designated by a number 
and a letter, the number designating the total 
number of lists learned and the letter whether 
the first and last lists were learned by massed or 
distributed Thus, the 4-D group 
learned four lists and the learning of the first 
and fourth list was with a 30-sec. 
intertrial interval, respectively. The six groups 
were 2-D, 2-M, 4-D, 4-M, 7-D, and 7-M. 

The groups which learned different numbers 
of lists did not learn the same lists. 


practice. 


and (-sec. 


The four 
lists were selected from the seven and the two 
from the four. Within each number of lists the 
different lists were systematically varied so each 
list appeared equally often at each stage of 
Standard for paired- 
associate anticipation learning were given prior 
to the learning of the first list and at relearning 
it was emphasized that S should anticipate as 
many syllables as possible on the very first trial. 


practice. instructions 


- 


OLE 
3 


The intertrial intervals longer than 4 sec. were 
filled with symbol cancellation (5 All learning 
and relearning was to a criterion of one perfect 


trial. Relearning was by massed practice for all 


groups for all lists. 


RESULTS 
Learning.-The mean number of 
trials required to learn each list to the 
criterion of one perfect trial is shown 
in Fig. 1. The graph is drawn as if 
it represents only the groups 
While one curve 
the 


be 


two 
learning seven lists. 
is labeled “ 
“distributed” it should remem- 
bered that this difference in conditions 
holds only for Lists 1 and 7; Lists 2 
through 6 were learned by 
practice. 


massed” and other 


massed 
The points in Fig. 1 are 
based on all data available for a given 
list in the series; thus N decreases as 
the 2- and #Hlist groups drop out. 
For the massed curve the N is 168, 
168, 112, 112, 56, 56, and 56 for Lists 
] through 7, respectively. The cor- 
responding N’s for the distributed 
curve are 168, 112, 112, 56, 56, 56, 
and 56. The learning of List 2 by 
Group 2-D and List 4 by Group 4-D 
are shown by separate points. 

It may noted first that the 
change in performance within days 
and between days for Lists 2-6 show 
a remarkable correspondence with the 


be 


results from the previous serial ex- 


periment (13, Fig. 1). There is no 
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way in the present data of deter- 
mining how much of these changes 
can be attributed to each of various 
phenomena, such as warm-up, prac- 
tice effects, and interference. As far 
as learning is concerned, the major 
interest is in the apparent consistently 
faster learning with distributed prac- 
tice than with massed practice. The 
mean number of trials required to 
learn Lists 1, 2, 4, and 7 by distributed 
practice is in each case less than the 
number of trials required to learn the 
corresponding massed list. However, 
it is possible that these differences 
may result, at least in part, from the 
selection of Ss by the conditions of 
the experiment. Although the lists 
had only four pairs of syllables, they 
were very difficult to learn; 135 Ss 
were dropped from the experiment, 
of which 60 were dropped for failure 
to learn within a 50-min. period. The 
remainder was lost for miscellaneous 
reasons such as failure to return for 
recall. The number of Ss dropped 
increased as the total number of lists 
to be learned increased and was 
greater for the distributed conditions 
than for the massed. Group 2-M had 
the fewest drops, Group 7-D the most. 
The present experiment was de- 
signed to test the interaction between 
number of previous lists and intertrial 
interval on retention and equality of 
groups is not essential for the measure 
of retention to be used. Neverthe- 
less, since the facilitation of learning 
of paired associates by distributed 
practice is rather rare, it becomes 
important to determine whether or 
not all of the differences between 
massed and distributed learning can 
be attributed to the selection of Ss. 
Within the experiment there are at 
least three tests of the hypothesis that 
learning is not facilitated by distri- 
bution of practice and each of these is 
biased in favor of the hypothesis. 
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1. Two groups, 4-D and 7-D, learned List |! 
by distributed practice and List 2 by massed 
practice. Both practice effects and warm-up 
would tend to make List 2 learned more rapidly 
if distributed practice did not facilitate the 
learning of the first list. Quite the opposite is 
true. The mean number of trials required to 
learn List 1 (distributed practice) by the 4-D 
and 7-D groups combined was 23.12 + 1.00, and 
was 29.25 + 1.47 for List 2 (massed practice). 
The difference between these two means is 
highly significant, giving a critical ratio of 3.46. 
Thus it appears that for the same Ss, where no 
selection could enter, distributed practice pro- 
duced faster learning. 

2. The other two tests of the effect of distri- 
bution of practice on learning consist of com- 
paring Group 2-D and combined Groups 4-D 
and 7-D on List 2, and comparing 4-D and 7-D 
on List 4. In both cases the distributed learning 
is compared with the massed learning of a group 
which has been more highly selected and this 
should reduce the apparent facilitation by distri- 
bution. For example, the 4-D and 7-D groups 
lost more Ss than did 2-D. ‘Thus, the selection 
of Ss should result in faster-learning Ss being in 
4-D and 7-D than in 2-D. For Groups 4D and 
7-D List 2 was learned by massed practice; yet, 
a mean of 29.25 + 1.29 trials was required by 
these groups to learn this list whereas Group 
2-D, learning List 2 by distributed practice, 
required only 20.73 + 1.29 trials. The differ- 
ence between these means gives a critical ratio 
of 4.37. 

3. Using the same logic as above, the 
learning of List 4 by Group 4D (distributed 
learning) can be compared with the learning of 
List 4 by Group 7-D (massed learning). ‘The 
respective means are 22.04+ 1.41 and 24.30 
+ 1.58. The difference between these means is 
not significant but its direction favors distributed 
practice. 


In two out of the three comparisons 
made, distributed practice gave sig- 
nificantly faster learning than massed 
practice even when the tests made 


were biased against such results. 
Thus, while there is no doubt a 
selection of Ss, involved, distributed 
learning appears to contribute to 
differences in speed of learning above 
and beyond this selection. To at- 
tribute all the difference between 
massed and distributed learning to 
selection of Ss would require a selec- 
tion of Ss on the first list who were 
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facilitated by distribution (the 
“drops” not being facilitated) and 
whose performance under distributed 
learning was related to subsequent 
distributed learning but not to subse- 
quent massed learning. Internal 
analysis (too detailed to report here) 
do not support this possibility. It 
may be concluded that it is highly 
probable that distributed practice 
facilitated the learning of these paired- 
associate consonant lists. 

The mean number of trials required 
to attain successive criteria on the 
first and last lists is plotted in Fig. 2. 
As in the serial experiment (13), the 
distributed groups are superior at all 
points during the learning of the first 
list but on the last list the massed 
practice groups are superior early in 
learning and then change to inferior 
performance. Figure 2 shows the 
acquisition of the combined groups 
on the last list learned. Plotting the 
acquisition of the last list for each 
stage of practice shows the same 
interaction in stage of learning and 
distribution. 

Retention.—Retention of the first 
and last list was tested 24 hr. after 
learning. The 168 Ss who learned 
the first list by massed practice re- 
called a mean of 1.92 + .073 items 
while the 168 Ss who learned the 
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TABLE 1 


Retention or THe Last List Wuen Learnep 
BY Massep Practice ano Distrisutep 
Practice AND Precepep sy |, 

3, ann 6 Lists 


: | 
Mean Number Mean Loss 
| 


Scores** 


uagees Recalled* 


Massed Dist. 


| 
1.25 | | 


Previous 
Lis 


| 
assed Dist 


1.36 1.41 
1.74 1.87 
1.92 


82 


75 2.06 





* Estimated oy, = .125. 
* Estimated o.,, = .123. 


first list with a 30-sec. intertrial 
interval recalled a mean of 1.97 
+ .076 items. This difference does 
not approach statistical significance. 
The mean number of items recalled 
from the last list after 24 hr. are 
presented in Table 1. The recall 
decreases as the number of previous 
lists increases. ‘The tendency for the 
distributed group to show better 
recall is to be expected from the more 
rapid learning. 

An analysis which attempts to 
control for the differences in degree 
of learning between groups which 
have learned to the same criterion at 
different speeds has been presented 
elsewhere (3,12). From a successive 
probability analysis of learning a 
prediction of the number of correct 
responses for each S if he had been 
given one more trial during acqui- 
sition is computed. The observed 
recall after 24 hr. is subtracted from 
the predicted score. These loss scores 
were computed for each group on the 
first and last lists. The mean loss 
scores for List 1 of the massed and 
distributed groups were .88 + .077 
and .97 + .079, respectively. This 
difference is not significant (C.R. 
<1). The mean loss scores for the 
last list are presented in Table 1. 
Again there is a tendency for there 
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to be less loss in the massed groups, 
but an analysis of variance of the 
loss scores gives an F of only 1.01. 
An F of 12.64 (2 and 330 df) indicates 
that the number of previous lists 
learned is a significant variable in 
retention. The greater the number 
of lists learned the greater the loss 
over the 24-hr. retention interval. 
The relearning scores show the same 
picture as do the recall scores. There 
is no indication of an interaction be- 
tween the number of previous lists 
and the intertrial interval during 
learning. 


Discussion 


Hovland (1) found that distribution 
of practice facilitated the learning of 
paired-associate lists, but other studies 
(4, 7, 8) using a 2:2-sec. rate of presen- 
tation have found no effect of distribu- 
tion on learning paired-associate lists. 
The material used in the present experi- 
ment was consonant syllables of from 50 
to 75% association value and the lists 
had high intralist similarity. With 
serial lists, where facilitation of learning 
by distributed practice is the rule rather 
than the exception, there is no indication 
that intralist similarity (6, 9) interacts 
with distribution of practice. In the 
same type of lists distribution is more 
effective with the low-meaningful ma- 
terial. Therefore it is concluded that the 
facilitation of learning of paired asso- 
ciates by distributed practice found in 
this study is due to the low meaningful- 
ness of the consonant syllables. 

This study has shown that increased 
interference from previous learning re- 
sults in decreased retention, but there is 
no evidence in the retention data that the 
intertrial interval during learning inter- 
acts with the interference. There were 
only two levels of distribution, but a 
supplementary study using a 2-min. 
interval on the seventh list showed no 
better recall than after a 4- or 60-sec. 
interval. It does not seem likely that a 
longer intertrial interval would change 
the results. Since there is a decrement 
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in retention related directly to number 
of previous lists learned comparable to 
that found in the serial study (13), it 
also seems reasonable to conclude that 
the lack of interaction at recall between 
intertrial and number of previous lists 
was not the result of a lack of inter- 
ference from previous lists. 

The trials-to-criteria curves (Fig. 2) 
suggest an interaction between intertrial 
interval and interference during learning. 
However, the hypothesis that this inter- 
action will also be reflected in the recall 
scores has received no support from the 
present study. From the results of the 
previous serial experiment (13) it was 
suggested that when interlist interference 
is high, a differentiation of the list being 
learned from previously learned lists is 
more effectively accomplished by dis- 
tributed practice than by massed prac- 
tice. This differentiation process is 
suggested by the initially poorer per- 
formance under distribution (than under 
massed) during early learning trials when 
interlist interference is high. This con- 
ception may still be adequate for the 
rate of loss of such differentiation over 


time is not known but would be expected 
to be lost more rapidly with paired- 
associate lists than with serial lists. A 
shorter retention interval for the present 
experiment might give the expected dif- 
ferences in recall. 


SUMMARY 


This experiment studied the learning and 
retention of paired consonant syllables as a 
function of intertrial interval and interlist 
interference. Interlist interference was manipu- 
lated by varying the number of highly similar 
lists used previously. There were six groups of 
56 Ss each. Three distributed-practice groups 
learned the first list with a 30-sec. intertrial 
interval and the last with a 60-sec. interval. 
Three massed-practice groups learned all lists 
with a 4-sec. intertrial interval. Both a massed 
and a distributed group learned a last list after 
this list was preceded by 1, 3, or 6 highly 
interfering lists. All lists were learned to one 
perfect trial with recall occurring after 24 hr. 

The results show that distributed practice 
facilitated learning. ‘This is attributed to the 
low meaningfulness of the material as compared 
with material used in previous experiments 
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where such facilitation was not found. Re- 
tention decreased consistently as the number of 
previous lists learned increased but no difference 
in retention was observed as a function of 
intertrial interval during learning. 
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SECONDARY EXTINCTION OF LEVER-PRESSING 
BEHAVIOR IN THE ALBINO RAT! 


WILLIAM W. ROZEBOOM * 
University of Chicago 


Interest has recently been revived in 
the problem of latent, or secondary 
extinction,’ a problem which may be 
formulated: If an instrumental re- 
sponse (R) has been conditioned to a 
stimulus (S,) by following the occur- 
rence of R in the presence of S, with 
a reinforcing stimulus (S*), and if the 
reinforcement value of S* is subse- 
quently modified, will the response 
to S, remain unaffected or will it have 
altered in accordance with the new 
reinforcement value of St? Existence 
of secondary extinction would seem 
to imply that instrumental behavior 
involves a mediation process whose 
behavioral effects are governed by 
those of the reinforcer through which 
the behavior is established (11). 


!The present data are taken from a dis- 
sertation submitted to the Faculty of the 
Division of the Biological Sciences, the Uni- 
versity of Chicago, in partial fulfillment of the 


requirements for the Ph.D. degree. ‘The writer 
wishes to express his appreciation for the 
valuable assistance of his sponsor, Dr. Howard 
F. Hunt, and for the generous support of the 
National Science Foundation, whose fellowship 
prograin has subsidized the writer’s graduate 
studies. This research was also supported in 
part by the Medical Research and Development 
Board, Office of the Surgeon General, Depart- 
ment of the Army, under Contract No. DA-49- 
007-MD-291. 

9 Now at the University of Minnesota. 

3 The title, “nonresponse-extinction” has been 
proposed by Deese (4) for this design. How- 
ever, this is also appropriate to extinction 
procedures (e.g., drug-induced response block- 
age) considerably different from the present 
paradigm. Nor is the term “latent extinction” 
quite appropriate here, since there is no period 
during which the extinction is latent in the usual 
sense of the word. The term “secondary ex- 
tinction” is here offered in analogy to “secondary 
generalization,” a phenomenon of similar 
theoretical import (see 11). 


Other than Miller’s (9) demonstration that 
shocking rats placed directly in a distinctive 
goal box tended to suppress a running response 
which had previously terminated in that goal 
box, evidence either for or against secondary 
extinction is surprisingly meager. Seward and 
Levy (12) and Deese (4) have reported positive 
findings, but both experiments are complicated 
by a differential secondary reinforcement of 
experimental and control groups (2, 10), an 
effect whose importance has clearly been shown 
by Moltz (10), who was unable to demonstrate 
positive results in its absence, and also more 
recently by Coate (3). Negative findings have 
been reported by Bugelski, Coyer, and Rogers 
(2) and by Scharlock (13), but since these 
experiments also failed to show a secondary 
reinforcement differential which should have 
appeared, their significance is problematic. 


The possibility of secondary ex- 
tinction is here explored in thirst- 
motivated lever pressing of the albino 
rat. Such behavior, aperiodically re- 
inforced by water reward, may some- 
what arbitrarily be analyzed as a 
response chain of three main links: 
(a) lever pressing, (b) approach to the 
water cup, and (c) drinking. Stimuli 
accompanying water delivery (“maga- 
zine stimuli”) serve to reinforce lever 
responses and provide discriminative 
cues for approach to the water cup 
(see 5). The question then arises 
whether, after the lever habit is 
established, altering the secondary 
reinforcement value of the magazine 
stimuli affects the strength of the 
lever habit prior to culmination of the 
response in the modified secondary 
reinforcement. The reinforcement 
properties of magazine stimuli may 
be altered, while levers are absent, in 
at least two ways (“magazine recon- 
ditioning”’): (a) extinction of positive 
value through repeated activations of 
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the dry water magazines (Exp. I); or 
(b) endowment with secondary aver- 
sive properties by pairings of maga- 
zine stimuli with shock (Exp. II). 
The extent, if any, to which magazine 
reconditioning has weakened lever 
responding may then be determined 
by replacing the levers and com- 
paring experimental and _ control 
groups for unreinforced lever presses 
(“lever extinction”). Whether maga- 
zine reconditioning was, in fact, 
accomplished may subsequently be 
tested by reconnecting the dry water 
magazines to the levers. Comparison 
of additional lever presses (“maga- 
zine extinction”) reveals the residual 
reinforcement properties of the maga- 
zine stimuli. 


EXPERIMENT | 


Method 


Subjects —The Ss were male albino rats of 
the Sprague-Dawley strain. All were about 


100 days old and experimentally naive at the 
beginning of the experiment. 


Forty Ss were 
given initial training, from which two matched 
groups of 16 Ss each were selected at the be- 
ginning of magazine reconditioning. 
Maintenance and running schedule.—The Ss 
were run daily for a 20-min. recorded interval, 
plus about 14 min. both before and after the 
interval recorded. They were always run under 
a 23-hr. water deprivation, water being received 
once a day for a period of 25 min. beginning 25 
min. after return to the home cage. Dry food 
pellets were always available in the home cage. 
Apparatus.—The Skinner boxes were a bat- 
tery of eight in regular use at the University of 
Chicago. These boxes are sound and light 
insulated, with electrifiable grill floors. The 
water delivery mechanism consists of a small 
cup which dips into an underlying reservoir 
when activated. A single horizontal T lever is 
located above and 3 in. to the right of the water 
cup. Throughout the present experiment, rein- 
forcement was controlled through a timing 
device which makes reinforcement available on 
a prearranged semi-random interval schedule. 
For the rates at which lever pressing stabilized, 
the schedule used furnished an average of one 
reinforcement for seven lever presses. Acti- 
vation of the water magazine produced two 
prominent clicks, over-all duration of which was 
approximately .5 sec. In addition, a small, dim, 


EXTINCTION 281 


neon bulb to the left of the water cup flashed on 
for .1 sec. at the beginning of magazine acti- 
vation. 

Pretraining.—F or the first six days of train- 
ing, Ss were run with disconnected levers to 
familiarize them with the apparatus and provide 
a record of unconditioned operant lever-pressing 
level. The latter variable showed little or no 
relation to subsequent behavior measures. 

Conditioning.—Three types of training were 
given during conditioning. (a) On 20 “lever 
conditioning” days (L), levers were connected 
to the full water magazines, with reinforcement 
available on the schedule reported. (b) On each 
of eight “magazine conditioning” days (M), 
levers were removed and magazines were acti- 
vated by E£, 20 activations per S on each of the 
first four days and 10 per S on each of the last 
four. The purpose of M was to establish and 
maintain the secondary reinforcement value of 
magazine stimuli independently of lever pres- 
ence. (c) On five “adaptation” days (A), 
levers were removed and no reinforcements were 
given. This was to sharpen discrimination of 
lever stimuli as the essential S”’s for lever 
pressing. The sequential order of training was: 
4M, 4L, M, 4L, M, 2L, 5A, 2L, M, 4L, M, 4L. 
An average response rate of around 200 lever 
responses per run was achieved by the fifth L 
day, gradually increasing to a daily mean of 
250 presses, averaged over the last four days. 
There was little inter-S variation in total number 
of reinforcements, and later statistical analysis 
revealed no correlation between this variable 
and other behavior measures. 

Magazine reconditioning.—After Day 20 of 
lever an experimental and a 
control group of 16 Ss each were selected, closely 
matched for terminal response output, as meas 
ured by the last four days of conditioning. With 
levers removed, the experimental group then 
received a total of 400 activations of the dry 
magazine, distributed over 25 runs in the se 
quence: 40, 40, 0, 40, 40, 0, 0, 20, 20, six days 
not run, 40, 20, 20, 0, 20, 20, 0, 0, 20, 20, 0, 0, 0, 
0, 20, 20. Spacing of extinction was designed 
to encourage spontaneous recovery and subse 
quent re-extinction. Control Ss were treated 
during magazine extinction exactly as were the 
experimentals except that controls received no 
magazine activations and hence no extinction of 
the positive secondary reinforcement value of 
Magazine stimuli. 

Lever extinction.—Both groups were then run 
for 16 days with levers replaced but disconnected 
from the water magazines. By the end of this 
time, response rates had returned to their 
original unconditioned operant level. 

Magazine extinction.—All Ss received a 
terminal eight runs wherein levers were recon- 


conditioning, 
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nected to the dry magazines, with magazine 
stimuli contingent upon lever pressing according 
to the same schedule encountered during con- 
ditioning. 


Results and Discussion 


Examination of Fig. 1 reveals the 
course of response output during lever 
extinction to be practically identical 
for both groups, with the experi- 
mentals, contrary to hypothesis, 
actually showing a greater total 
output than controls. (The un- 
usually large difference on Day 14 
was primarily the contribution of a 
single S, who responded in a highly 
atypical manner that day.) The 
only evidence remotely favoring the 
existence of a stronger response tend- 
ency in the control group is its 
slightly greater output for the first 
two days. To test the significance 
of this, the sum of the first two days’ 
output was subjected to an analysis 
of covariance (14, p. 378 ff.), even 
though post hoc selection of variables 
in this way biases statistical tests 
toward spurious significance. When 
the regression of the first two days’ 
lever extinction output on terminal 
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during “lever extinction” in Exp. I. 


ROZEBOOM 





° 
° 


3S 
° 


$ 
} 


————oe EXPERIMENTAL 


MEAN CUMULATIVE RESPONSES 


-— —«# CONTROL 











Fic. 2. Cumulative mean lever responses 
during “magazine extinction” in Exp. I. 


conditioning rate is accounted for 
(r = .54), the ratio of between-group 
to within-group variance is F = .96, 
which occurs under the null-hypothe- 
sis with probability P = .36 (1 and 
29 df). During magazine extinction 
(Fig. 2), however, a consistent dif- 
ference appears. Analysis of covari- 
ance applied to the first day’s output 
gives F = 7.00 (P = Ol, 1 and 29 
df). Thus, while there is no de- 
tectable difference between experi- 
mental and control groups in lever 
extinction outputs, reintroduction of 
magazine stimuli contingent upon 
lever pressing reveals clearly a greater 
residual reinforcement value of maga- 
zine stimuli for control Ss. 


It would appear that under the param- 
eters of this experiment, at least, partial 
extinction of a secondary reinforcer with 
whose aid an instrumental response has 
been established has no effect on the 


strength of the habit. It might be 
protested that the secondary reinforcers 
were insufficiently extinguished for the 
experimental group to display a detect- 
ably weakened habit, for magazine 
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TABLE 1 


Measures or Perrormance, Expertments I anv II 


Terminal 


First 2 Days Lever 
Cond, Rate 


Extinction 
Group 


| 


| 
j - | mY j | | 
| Mean SD Mean*; SD _ (Adj. SDt| Mean* 
| ! ! 


First Day Magazine 


Extinction Extinction 


Total Lever | 


% j 
SD |Adj. SDt|Mean*) SD Adj. SDt 


| 


Experiment I 


53.6 | 
68.4 | 


63.1 
62.9 


| j 
22 8 | 80.7 | 107.8 | 
16 4} 76.0 | 121.4 | 
16 8 | 82.7 | 108.5 


37.6, 


| 87.8 84.2 
93,9 


272.4| 64.8) 61.3 


} 231.1 101.8 | 83.6 73.5 


| 


* Since groups were matched for terminal conditioning rate, adjusted means differ only insignificantly from 


observed means 


1 Best estimate of parametric SD after removal of variance component attributable to terminal conditioning rate. 


reconditioning produced a reinforcement 
loss of only about 25% (based on total 
responses during magazine extinction 
less an operant level base line of 10 
responses per day), even though Beck- 
with’s data (1) suggest that less than 
half this number of dry magazine acti- 
vations following lever responses would 
have been ample for complete extinction. 
The fact remains, however, that the 
reinforcement difference, though small, 
was still large enough to be clearly 
evident statistically. There is no ap- 
parent reflection of this difference during 
lever extinction, and it must be con- 
cluded that the strength of a lever habit 
is at best quite insensitive to changes in 
reinforcement value of the stimuli by 
which the habit was initially reinforced, 
at least under the present parameters of 
conditioning. 


EXPERIMENT II 


Experiment II was conducted in 
the hope that the behavioral effect of 
a secondary reinforcer could be altered 
more radically by pairings with a 
primary aversive stimulus, and hence 
more likely reveal secondary extinc- 
tion if this phenomenon exists. 


Method 


The Ss were similar to those of 
Exp. I, a total of 54 being used. No Ss were 
lost or discarded during the experiment. 
Apparatus, 
schedule.—Identical to Exp. I. 
Pretraining and conditioning 


Subjects. 


maintenance, and running 
Treatment of 
Ss in Exp. II for these phases was, with three 
minor exceptions, identical to that of Exp. I: 
(a) Ss were given only four pretraining runs; 
(b) only three days of magazine conditioning 
were given prior to the first day of lever condi 
tioning; and (c) to facilitate later 
on the four magazine 
conditioning days, all Ss also received 10 pres- 
entations of the later 
Group C, during magazine reconditioning. 


discrimi 


nations, each of last 


tone experienced by 
The 
mean daily response output over the last four 
days (terminal conditioning rate) was 244, very 
similar to the corresponding rate in Exp. I. 

Magazine reconditioning.—The Ss were then 
divided into three groups closely matched for 
terminal an experimental 
group (FE) of 22 rats, and two control groups 
(Cy, Cz) of 16 rats each. As in Exp. I, levers 
were throughout 
tioning, and the water magazines were always 
dry. 

During the first four days, each S in Group 
E received 120 presentations of the dry water 
magazine, 40 on each of the first two days and 
20 on each of the others. 
were run as usual, but received no magazine 
stimuli. During the next 22 runs, each S of 


conditioning rate; 


absent magazine recondi 


Groups C, and C; 
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every group received a total of 40 electric shocks. 
The number of shocks varied from day to day 
according to the sequence: 3, 2, 4, five days not 
run, 3, 1, 2, 4, 0, 0, 4, 2, 0, 3, 0, 1, 2, 1, 0, 0, 4, 
4,0. The shock circuit was set to deliver an 
alternating current of approximately 1.0 ma. 
for .2 sec., automatically timed. For Group E, 
each shock was preceded by activation of the 
dry water magazine, with an automatically 
timed interval of .5 sec. between cessation of 
magazine stimuli and onset of shock. Group 
C, had each shock preceded by a medium in- 
tensity, 1000-cps tone, of .5 sec. duration and 
with an interval of .5 sec. between tone offset 
and shock onset. Group C, received shock 
alone, unheralded by any warning signal. 

Lever and magazine extinction.—As in Exp. I, 
all Ss were now run with levers replaced, but 
disconnected. When lever extinction outputs 
appeared asymptotically equal for all groups, 
the dry magazines were reintroduced for seven 
days of magazine extinction. 


Results and Discussion 


Although the response outputs of 
Groups E and C, during lever extinc- 
tion can scarcely be distinguished in 
Fig. 3, the record of C, shows a 
distinct superiority. To test the sta- 
tistical significance of this difference, 
two preselected measures—sum of 
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during “lever extinction in Exp. II. 
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responses for the first two days and 
total lever extinction presses—were 
separately subjected to an analysis 
of covariance in order to control for 


‘ the variance component attributable 


to terminal conditioning rate, upon 
which the groups were matched. The 
between-group to within-group vari- 
ance ratio for Groups E and C, in 
total lever extinction output is 
F = 1.98, which, for 1 and 35 df, 
gives P = .20 that this great a mean 
difference could have occurred by 
chance. For the total lever ex- 
tinction output of Groups C, and Cy, 
F = 2.46, for which P = .14 (df = 1 
and 29). The corresponding chance 
probabilities of the differences in the 
first two days of lever extinction are 
somewhat higher (P = .32 and P 
= .30, respectively). The data thus 
fall short of conventional statistical 
significance and do not warrant the 
conclusion that the theoretical popu- 
lations, of which E, C,, and Cy, are 
samples, differ in lever extinction 
output. However, neither do these 
data support the conclusion that there 
are no genuine group differences, for 
failure to reject the null-hypothesis 
is not in itself grounds for its ac- 
ceptance, and the observed differ- 
ences are too large to be completely 
ignored. Under the null-hypothesis, 
a difference between C, and E as 
large as the one observed and in this 
direction, as predicted, would be 
expected only once in ten, which, in 
the absence of further evidence, is a 
precarious statistic upon which to 
rest the conclusion that Groups E 
and C, did, in fact, show equal 
parametric response tendencies during 
lever extinction. Of much less am- 
biguous interpretation are the differ- 
ences during magazine extinction 
(Fig. 4), as revealed by analysis of 
the first day’s outputs. Covariance 
analysis yields a between-group to 
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Fic. 4. Cumulative mean lever responses 
during “magazine extinction” in Exp. II. 


within-group variance ratio of F 
= 5.65 for Groups C, and Cy, for 
which P= 02 (df =1 and 29), 
while a x? comparison of scores 
dichotomized at the common median 
reveals the inferiority of E to be 
highly significant (P < .0O1). 


Before these results can be evaluated, 
it is necessary to judge the extent to 
which comparison of lever extinction 
outputs is here a fair comparison of 


response tendencies. As in Exp. I, the 
only external stimuli present during lever 
extinction with contingency histories 
differing among the groups were the 
general environmental stimuli. In con- 
trast to Exp. I, however, there is sub- 
stantial empirical evidence from which 
here to anticipate differences in lever 
extinction outputs due to environmental 
stimuli alone, irrespective of any addi- 
tional secondary extinction effects. It 
is a solidly established fact (e.g., 7) that 
stimuli associated with aversive stimuli 
in the Skinner box subsequently suppress 
lever pressing. (A plausible explanation 
is that such stimuli, having become 
secondarily aversive through type-S con- 
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ditioning, now elicit “anxiety” responses 
—crouching, defecating, etc.—-which 
compete with other behavior.) This 
general suppressant effect may clearly 
be seen in the present data by comparison 
of the lever extinction outputs of the 
control groups in Exps. I and II. 

Merely giving experimental and con- 
trol Ss an equivalent amount of shock, 
however, is insufficient to equate the 
behavioral effects of environmental 
stimuli, since experimental Ss also re- 
ceived magazine stimuli as a signal for 
shock. It is to be expected both on 
theoretical (6) and experimental (8) 
grounds that a warning signal prior to 
an aversive stimulus should decrease 
aversive conditioning of the environ- 
mental stimuli, since the latter then 
form only a part of the total secondary 
aversive compound. This effect may be 
called, in a purely descriptive sense, the 
“buffer” effect. For present purposes, 
the buffer effect implies that magazine 
stimuli preceding shock should decrease 
the subsequent suppression of lever 
responding by environmental stimuli. 
For this reason, two control groups were 
run, one (C;) receiving a warning tone 
before shock, and the other (C,) re- 
ceiving none. It was felt that Group 
C, should be most comparable to Group 
E for the behavioral influences of en- 
vironmental stimuli, while Group C, 
would check the importance of the 
warning signal. While the difference 
between C,; and Cy, in Fig. 3 does not 
quite attain statistical significance, Fig. 
4 reveals that with high likelihood, a 
warning signal before shock does tend 
to lessen the aversive conditioning of 
environmental stimuli. 

The possibility must also be 
sidered, however, that tone might have 
been more effective than 
stimuli as a buffer, and hence have 
produced a lever extinction superiority 
of Group C, over Group E apart from 
any secondary extinction effects. There 
are, unfortunately, no experimental data 
on the determinants of buffer efficiency. 
For present purposes, an attempt was 
made to relative “anxiety” 


con.- 


magazine 


evaluate 
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during lever extinction by consideration 
of defecation rates. ‘The somewhat in- 
volved analysis (11) points to an “anx- 
iety” ordering of C; < E < Cy, with 
Group C, the most disturbed as pre- 
dicted, but with Group E as close or 
closer to Group C, than to Group C,. 
Presuming an approximately linear re- 
lationship between degree of “‘anxiety” 
and response suppression over the range 
concerned, this finding suggests that at 
least half the observed difference be- 
tween Groups C; and E, already of 
questionable significance, may be due 
to emotional differences. It must there- 
fore be concluded that, although the 
possibility cannot entirely be ruled out 
that pairing magazine stimuli with 
shock might have produced a slight loss 
of lever-pressing strength, the effect is 
scarcely detectable, at best of only 
minor importance in the composition of 
the habit. 5 

Failure to detect any appreciable 
effect, moreover, can scarcely be at- 
tributed to inadequate reconditioning of 
magazine stimuli. The differences be- 


tween Group E and Group C, or Group 
C, during magazine extinction are even 


greater than shown in Fig. 4. The 
distribution of Group E was markedly 
skewed, with but three Ss (14%) ac- 
counting for 42% of the group’s total 
output on the first day. Moreover, a 
typical § in Group E showed a positively 
accelerated daily cumulative magazine 
extinction curve, clearly indicating ex- 
tinction of the secondary aversive prop- 
erties of the magazine stimuli and 
subsequent reinstatement of their yet 
unextinguished positive reinforcement 
value. In contrast, Ss of both control 
groups gave conventional negatively 
accelerated extinction curves. Ob- 
viously, then, pairings with shock were 
highly successful in endowing magazine 
stimuli with aversive properties. 
Experiments I and II are in complete 
agreement that under at least some 
parameters of learning, an instrumental 
response, once conditioned, is little if at 
all affected by subsequent modifications 
in value of the reinforcers by which it 
was established. The present data do 
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not, of course, disprove the existence of 
secondary extinction. They do, how- 
ever, indicate the phenomenon to be less 
evident than might be anticipated on 
theoretical grounds. That secondary 
extinction does, in fact, exist outside the 
laboratory would seem scarcely to be 
doubted in view of the flexibility of 
behavior shown by at least the higher 
primates. But if it is the case that an 
effect so potentially important in the 
behavioral economy of the organism is 
this dependent upon the particular 
parameters of the learning situation, 
then investigation of the identity and 
functioning of such parameters becomes 
of considerable importance to experi- 
mental behavioristics. And if further 
research still fails to reveal secondary 
extinction, it may be necessary to revise 
radically some of our expectancies about 
the constitution of behavior. 


SUMMARY 


An attempt was made to discover secondary 
extinction in thirst-motivated lever pressing of 
the albino rat. A water-reinforced lever habit 
was established in thirsty rats, with stimuli 
accompanying water delivery (magazine stimuli) 
mediating reinforcement of the habit. The 
effect on the lever habit of modification in the 
reinforcement value of magazine stimuli was 
then determined. Two experiments are re- 
ported, one in which simple extinction of maga- 
zine stimuli as positive reinforcers was at- 
tempted, and another in _ which 
stimuli were also aversively conditioned. 
Neither experiment resulted in any apparent 
degree of secondary extinction. 


magazine 
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ERROR REINFORCEMENT IN A MODIFIED SERIAL 
PERCEPTUAL-MOTOR TASK ! 


MELVIN H. MARX AND ROBERT A. GOLDBECK ? 


University of Missouri 


A recent study by Marx (2) re- 
ported evidence supporting a rein- 
forcement explanation of the spread- 
of-effect phenomenon. A __ serial 
perceptual-motor task was used in an 
attempt to eliminate the usual re- 
sponse-bias tendencies characteristic 
of the number-guessing experiments. 
This task also had the advantage of 
permitting a more sensitive measure 
of response strength in that measure- 
ment of error gradients was not 
dependent upon exact repetition of 
errors (cf. 1). 

The purpose of the present study 
was to confirm the critical role of 
reinforcement in the production of 
after-error response strength, using 
a modified version of the task earlier 


used by Marx. ‘The modification was 


intended to eliminate response biases 


associated with the central 


position of the apparatus. 


rest 


Mernop 
Subjects 


The Ss were 379 male volunteers from the 
freshman and sophomore Air ROTC classes at 
the University of Missouri. 
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tion of the Air ROTC unit at the University of 
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This research was supported in part by the 
United States Air Force under Contract No. 
AF 33(038)-25631, monitored by the Perceptual 
and Motor Skills Research Laboratory, Human 
Resources Research Center. Permission is 
granted for reproduction, translation, publica- 
tion, use, and disposition in whole and in part, 
by or for the United States Government. 

*Now at American Institute for Research, 
Pittsburgh, Pennsylvania. 
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Apparatus 


The apparatus was a modified Mashburn 
Complex Coordinator (5). A wood panel with 
a white paper covering was mounted on the 
front of the regular Mashburn panel. This 
special panel contained two rows of 13 lights 
which showed through the paper covering. ‘The 
upper row of lights was seen through a green 
filter. Each vertical pair of lights was framed 
by wood strips, forming 13 boxes. The S could 
light any one of the 13 white lights by the left- 
right movement of the Mashburn control stick. 
The S could turn on the green light in the box 
above any selected white light by pushing the 
Mashburn rudder bar to the extreme right 
position. By means of a switch hidden from S, 
E could either allow the green light to come on 
or a buzzer to sound when S pushed the rudder 
bar. 

There was no spring tension on the stick for 
left-right movement; the usual central resting 
position was thus eliminated. 


Procedure 


The task was represented to S as a position 
memory test; he was told that the object of the 
test was to remember correct responses which 
could be discovered only by chance. The E 
called out the numbers 1 through 15, which 
served as the stimulus items. When E called 
out a stimulus number, S moved the stick to 
light one of the white lights. 

Phase 1.—The S moved the stick from the 
extreme right position to make each response. 
Holding the stick in position, S then pushed the 
rudder bar to determine whether the light was 
the correct one for that stimulus number. A 
correct response was signaled by the flashing of 
the green light; the sounding of the buzzer 
designated an incorrect response. The E called 
out the next stimulus number approximately 1 
sec. after S returned the stick to its position at 
the right. 

A trial consisted of the responses to the 15 
numbers. Trials were separated by £’s state- 
ment that the 15 numbers would be gone through 
again and that any correct responses were to be 
repeated, 

The Ss were discouraged from using any 
system in choosing lights. As an aid for re- 
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membering light positions without counting 
their distance from the ends, paper strips were 
placed beneath the four lights on each end of the 
row. 

The experimental and control groups each 
consisted of 150 Ss who provided usable data 
for this phase. The Ss in the control group 
were given two trials without reward; each 
response was followed by the buzzer. If none 
of the responses to numbers 1 through 12 was 
repeated from Trial | to Trial 2, a third trial 
was given. If S produced more than one 
repetition, the one occurring earliest in the list 
of numbers was used. A total of 22 Ss failed 
to repeat a single response under these conditions 
and so could not be used in this phase of the 
experiment. 

For each control S, a paired S in the experi- 
mental group was given a reward following his 
response to the same stimulus number. In 
order to make the groups as comparable as 
possible only Ss who repeated this rewarded 
response were used as experimental Ss in this 
phase of the experiment. A total of 57 Ss had 
to be replaced for failure to repeat the correct 
response before the necessary 150 experimental 
Ss were obtained. 

Phase 2.—Following the trials given in Phase 
1, all Ss were given two additional trials. For 
these trials, Ss were instructed to move directly 
from one light to the next without moving the 
stick back to the right after each response, as 
they had done on the earlier trials. 

Under these conditions, 61 Ss from the control 
group in Phase 1, who had received no reward, 
were rewarded for the response to Stimulus 14 
or 15 on the two trials of Phase 2. The reward 


was given to maintain motivation and morale. 


This procedure allowed for the occurrence of 
chance repetitions within the first 10 stimulus 
numbers which would not be influenced by the 
aftereffects of reward and so could be used as 
controls. 

All other Ss were given two rewards on each 
trial. The Ss from the experimental group of 
Phase 1 were rewarded for the response to the 
same stimulus number for which reward had 
earlier been given. This was done to insure 
continuity of reward for Ss who repeated previ- 
The 57 Ss eliminated 
from the Phase | analysis because of failure to 
repeat the rewarded response were treated in the 
The 
Ss from the control group of Phase 1 were re- 


ously rewarded responses. 


same manner as the other experimental Ss. 


warded for the response to the stimulus number 
for which the critical chance repetition had 
earlier This done to equate 
reinforcement conditions with the Ss from the 
Phase 1 experimental group. In all cases the 


occurred, was 
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second reward was given for a response to a 
stimulus number far enough removed from the 
first reward to be free of its aftereffect. 


ResuLts 


The basic measure of response 
strength used to evaluate’ error 
strength was the difference, in terms 
of light units, between the responses 
made immediately following the re- 
warded response and its repetition on 
the next trial. This difference score 
for the first after-error ranged between 
0 and 12, with a low difference score 
indicative of high response strength. 


Phase 1 


The mean difference score for the 
first after-error was 2.79 for the ex- 
perimental group, and 3.53 for the 
control group. A matched-pairs ¢ 
test showed this difference in favor 
of the experimental group to be sig- 
nificant (t = 2.42, P < .02 for 148 
df). This finding indicates that re- 
ward increased the tendency to repeat 
the following response over and above 
any increment associated with pre- 
experimental response sequences. 

It was considered possible that the 
role of reward was to produce some 
special response bias on both trials. 
The correlation between first- and 
second-trial responses was therefore 
obtained for the first after-error. 
Because a group response bias would 
not produce intertrial ordering of 
scores, the correlation coefficient 
should not be sensitive to the re- 


stricted range of responses produced 


by such a response bias. The ob- 
tained r’s for the experimental and 
control groups respectively were .113 
and — .024, with .159 needed for 
significance at the .05 level. The t 
for the difference between these two 
correlations was 1.17, with 1.96 
needed at the .05 level. Failure to 
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obtain evidence of a significant cor- 
relation for the experimental group 
leaves open the possibility that the 
tendency of this group to repeat the 
first after-error response might be 
attributed to a response bias produced 
by reward. 


Phase 2 


In order to use only completely 
new data, difference scores were 
computed only from the 164 experi- 
mental Ss who repeated the rewarded 
response to stimulus numbers not 
used for reinforcement in Phase 1. 

For the 61 control Ss, receiving 
only one reward at the end of the list, 
data following the first repeated 
response were used, but only if 
repetition occurred at least four 
stimulus steps before the rewarded 
response. The 43 Ss meeting this 
criterion constituted the control 
group. 

The mean difference score for the 
first after-error was 2.54 for the 
experimental group, 4.21 for the 
control group. This difference was 
clearly significant (F = 13.75, P < .01 
for 1 and 205 df). 

To determine if the low difference 
score for the experimental group 
could be attributed to a response bias 
associated with reward, the cor- 
relation between intertrial first after- 
error responses was obtained. An r 
of .291 (P < .01) lends support to an 
individual-strengthening interpreta- 
tion of the data. The obtained r for 
the control group was — .235, short 
of significance (.304 needed at .05 
level). The ¢ for the difference be- 
tween these two 7's was 3.05 
(P < 01). 

To rule out the possibility that the 
correlation between responses fol- 
lowing reward was due to some cor- 
relation between the rewarded re- 
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sponse and the response following it 
on both trials, these correlations were 
partialled out of the after-error cor- 
relation for the experimental group. 
The partial r of .276 obtained was 
still significant at the .01 level. 


Discussion 


The present results, particularly those 
in Phase 2, offer further support for the 
reinforcement interpretation of the error- 
reinforcement, or spread-of-effect, data. 
As in previous research reporting positive 
results (2, 3, 4), the use of repeated non- 
rewarded responses as controls and 
application of the _partial-correlation 
technique have served to eliminate the 
effect of intratrial influences such as are 
emphasized in the usual guessing-se- 
quence type of artifactual explanation 
(cf. 1 for a critical review of this prob- 
lem). The present results may be more 
particularly offered in support of the 
modified Thorndikian interpretation (2), 
which stresses the role of the repeated 
rewarded response as a stimulus factor 
as well as a reinforcer for the first after- 
error. 

In the present experiment only the 
first after-error difference scores were 
analyzed. This was done on the ground 
that the first after-error offers the best 
and simplest opportunity for positive 
evidence; such evidence, if adequately 
controlled, is all that is needed to estab- 
lish the reinforcement interpretation. 
The more complex question of the com- 
plete gradient effect, with which Thorn- 
dike was originally concerned, can better 
be handled later. 

The implications of the more con- 
sistent statistical evidence found in 
Phase 2 are not clear. The sequential 
resting-position procedure and the use 
of later trials were confounded. It is 
not possible to say on the basis of the 
present data which, if either, of these 
factors was responsible for the more 
clear-cut positive results. 

The fact that fewer control than 
experimental Ss were eliminated from 
the analysis in each phase reflects the 
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necessary difference in treatment (con- 
trol Ss merely required to have a chance 
intertrial repetition, experimental Ss 
required to repeat a correct response). 
It was necessary to have the critical 
response repeated in the experimental 
group so that the response-bias factor 
would be present in each case and any 
difference could then be attributed to 
the addition of the reinforcement factor. 
If there was any selection bias introduced 
from differential sampling it would 
appear to have been against the rein- 
forcement hypothesis. Elimination of 
experimental Ss who failed to repeat a 
correct response would tend to eliminate 
the less strongly motivated and less 
capable retainers. The remaining Ss, 
as better retainers, would also be 
expected to recall and vary more of the 
errors made and therefore be less likely 
to show the spread effect. It is, of 


course, impossible to say on the basis 
of the present data to what extent a 
selection bias was operating, but the 
important point is that if present it 
would seem to work against, rather than 
for, the hypothesis. 


SUMMARY 


An experiment was designed to control for 
response biases produced by the central rest 
position of an apparatus used in an earlier study 
which had given positive spread-of-effect results. 
Two methods of correction were used: (a) the 
extreme right position was used as the rest 
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position; (b) the previous response position 
‘served as the rest position. 


The task was 
presented to 379 Ss as a position memory test. 
Typical prearranged spread-of-effect reinforce- 
ment conditions were employed. Response-bias 
factors were controlled by the use of error scores 
following repeated nonrewarded responses and 
by the application of the partial-correlation 
technique. 

Errors made immediately following repeated 
rewarded responses were significantly stronger 
than errors made immediately following repeated 
nonrewarded responses. These results were 
seen as offering further support for the rein- 
forcement interpretation of spread-of-effect data. 
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CONDITIONING OF MUSCLE ACTION POTENTIAL 
RESPONSES RESULTING FROM PASSIVE 
HAND MOVEMENT! 


DONALD G. DOEHRING? 


Indiana University 


Passive movement can be defined 
as displacement of part of an organism 
produced by the action of an agent 
other than the organism itself. As 
commonly used, the term seems to 
imply that the organism does not 
actively participate in the movement. 

The conditioning of passive move- 
ment has been attempted in two 
experiments. Miller and Konorski 
(6) found that when a stimulus was 
paired with the passive lifting of a 
dog’s foot, and this combination was 
followed by the presentation of food, 
the stimulus came to serve as a CS 
for an active flexion response. Wood- 


bury (9) used a similar procedure, 
modified to prevent the possibility of 


conditioned avoidance movement or 
postural adjustment, and failed to 
obtain a CR. 

Conditioning should take place if a 
stimulus is paired with a response 
and the response is followed by a 
reinforcing state of affairs, according 
to most learning theories. From 
Woodbury ’s results, then, it might be 
concluded that passive movement 
does not involve response by an 
organism. Passive movement would 
thus be qualitatively different from 
reflex or voluntary movement. 

If passive movement does involve 
organismic responses, however, these 


'This article is based upon a dissertation 
submitted to the department of psychology at 
Indiana University in partial fulfillment of the 
requirements for the Ph.D. degree. ‘Thanks are 
due to Dr. R. C. Davis who directed the dis 
sertation. 

? Now employed by Central Institute for the 
Deaf, St. Louis, Mo. 
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responses would probably be rela- 
tively small. Any CR related to 
these responses would probably be 
even smaller. The measure of re- 
sponse (treadle movement) used by 
Woodbury may not have been sensi- 
tive enough to detect such small 
responses. The recording of action 
potentials from muscles directly. af- 
fected by passive movement should 
provide a more sensitive measure in 
an investigation of the conditioning 
of passive movement. 

In the present experiment passive 
hand movement was paired with a 
CS, and muscle action potentials were 
recorded from muscles most likely to 
be involved in the passive movement. 
The problems to be investigated were: 
(a) Do action potential responses 
occur in muscles directly affected by 
passive hand movements? (b) Can 
conditioned muscle action potential 
responses be developed in situations 
where the UCS is a passive hand 
movement? 


Metuop 


Experimental design.—The conditioning pro- 
cedure was patterned after the Pavlovian 
paradigm. The CS was a weak 2-sec. white 
for all groups, whose termination was 
immediately followed by the UCS on condi- 
tioning trials. ‘Two experimental groups of 10 
Ss each were used, with one group (XA) re- 
ceiving passive extension of the right hand and 
the other group (XB) receiving passive flexion 
of the right hand as the UCS. Corresponding 
control groups (CA and CB) of 10 Ss each 
received auditory stimuli attendant upon the 
production of passive movement without re- 
ceiving the actual movement. Trials 1, 8, 12, 
15, and 21 through 23 were test trials, where the 
CS was presented alone. Trials 2-7, 9-11, 13, 


noise 
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14, and 16-20 were conditioning trials, where 
the CS was paired with the UCS. 

Subjects—The Ss were 40 students from 
introductory courses in psychology at Indiana 
University, none of whom had served in pre 
vious muscle action potential experiments. 

A pparatus.—The equipment and techniques 
used for the measurement and recording of 
muscle action potentials have been previously 
described by Davis (1, 2, 3). Briefly, muscle 
action potentials are picked up by recording 
electrodes, amplified, integrated over .l-sec. 
intervals, further amplified, and recorded by 
mechanical recording pens. The height of each 
recorded peak represents the total microvolts of 
potential within a given .l-sec. interval. One 
change made in the usual recording procedure 
was the use of bipolar electrodes in place of 
monopolar electrodes in order to obtain a more 
precise measure of the 
extensor and 


difference between 
flexor levels of response. Re- 
cordings from each muscle were made by the 
use of a pair of }-in. brass electrodes mounted 
4 in. apart on a piece of Plexiglas. Since the 
distribution of the logarithms of microvolts of 
action potential has been found to be more 
nearly normal than that of unconverted meas- 
ures (5), all action potentials were measured on 
a 100-log microvolt scale. 

Apparatus for producing passive movement 
included a 21-in. X 4-in. strip of wood covered 
with sponge rubber, on which S rested his right 
forearm. ‘The strip was fastened to the top of 
a table in the experimental room. The front 
section of the strip was movable, being hinged 
at a point directly under S’s wrist. Two coil 
springs were fastened to the front of the movable 
section, each exerting a force of 29 oz. when the 
section was in a horizontal position. The 
movable section was held down by a metal rod 
which was connected at the other end to a pair 
of solenoid plungers, the coils of which were 
mounted on the outside of the experimental 
room. When the solenoids were activated, the 
bar was withdrawn from a roller bearing 
mounted on the underside of the movable 
section, allowing it to be raised by the springs 
to an angle of about 30° from the horizontal. 
The movable section could then be returned to 
a horizontal position by pulling a string from 
the outside of the experimental room. Rein- 
sertion of the bar maintained this position until 
the solenoids were again activated. 

The CS was a 37 db white noise produced by 
a white noise generator and delivered through 
a loudspeaker mounted in a wall of the experi- 
mental room. ‘The white noise could be heard 
easily by all Ss, but was not loud enough to 
produce a large muscle potential response by 
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itself. The duration and temporal contiguity 
of the white noise and solenoid activation were 
controlled by an electronic timer (8). A con- 
tinuous 35 db low-frequency noise was provided 
by the hum of a loudspeaker located directly 
below the speaker which delivered the white 
noise. This served to mask extraneous sounds 
from outside the experimental room. 

Procedure.—The Ss were divided randomly 
into four groups of 10 each. Their order of 
running was randomized. 

Muscle potentials were recorded from two 
muscles on the right forearm, the extensor 
digitorium and flexor digitorum. The muscles 
were located by palpation, and a pair of elec 
trodes was over the crest of each 
muscle after electrode paste had been applied 


fastened 


The S was seated in a darkened, sound 
insulated experimental room with his right arm 
loosely fastened to the sponge rubber covered 
strip of wood in such a way that his wrist was 
directly over the hinge of the movable section. 
For Groups XA and CA the palm of the hand 
was down, and for Groups XB and CB the palm 
wasup. The S was told that the purpose of the 
experiment was to investigate the effects of 
several types of distraction and was asked to 
remain as relaxed as possible. ‘The first trial was 
given about 10 min. after S had been placed in 
the experimental room, and the intertrial interval 
was varied unsystematically between 40 and 0 
sec. 

The CS was presented to all groups on all 
trials. On conditioning trials the solenoids were 
activated for .2 sec. immediately following the 
cessation of the CS. For Groups XA and XB 
this resulted in a gradual upward movement of 
the right hand. The remained in this 
position for about 6 sec. and was then lowered. 
For ¢ sroups CA and CB the movable section was 
tied down and these groups consequently re 
ceived only the noise produced by 
activation as the UCS. 


hand 


solenoid 
Measurement of records.—The largest of the 
five integrated action potentials occurring during 
the .5-sec. interval immediately preceding the 
CS on each trial was used as the prestimulus 
level of response for that trial. The highest 
peak in each of the four .5-sec. intervals occurring 
during the CS was measured on all trials. Since 
solenoid activation produced a recording arti 
fact, the unconditioned response (UCR) was 
determined by measuring the 
occurring in each of the two intervals 
following the break of the solenoids. On test 
trials the CR was determined by measuring the 
highest peak in each of the first three .5-sec 
intervals following the termination of the CS 


highest peak 


5 sec 


In all cases, responses during a given trial are 
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expressed as increments from the prestimulus 
level on that trial. 


RESULTS 


In the interest of brevity, results 
will be given only in terms of changes 
in response within pooled trials. 
Figure 1 shows increments in response 
of extensor and flexor muscles for 
Groups XA and CA, while Fig. 2 
shows corresponding increments for 
Groups XB and CB. Measures dur- 
ing the CS interval represent a pooling 
of CS interval responses on Trials 3 
through 23. CR interval measures 
represent a pooling of CR interval 
responses on all test trials except 
Trial 1. UCR interval measures 


represent UCR interval measures on 
all conditioning trials. 

From Fig. 1 and 2 it can be seen 
that for Groups XA and XB the level 
of tension of the extensor and flexor 
muscles steadily increased during the 
presentation of the CS. A definite 


UCR was produced in both muscles, 

which decayed rapidly. On test trials 

there was a slight increase in tension 
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Fic. 1. Response increments within trials 
for Groups XA and CA. Each point represents 
a mean value for all Ss in a given group. 
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Fic. 2. Response increments within trials 
for Groups XB and CB. Each point represents 
a mean value for all Ss in a given group. 


of both muscles during the CR 
interval, as compared with the CS 
interval. For both experimental con- 
ditions (extension movement with the 
palm down and flexion movement 
with the palm up) there was no 
consistent difference between extensor 
and flexor levels of activity. 

The level of tension for both 
muscles in Groups CA and CB re- 
mained at about the prestimulus 
level during the CS and CR intervals. 
During the UCR interval, however, 
there was a slight increase in tension, 
indicating a response to the sound 
produced by solenoid activation. 

The above results show graphical 
evidence of greater increments of 
tension for the experimental groups 
as compared to their respective con- 
trol groups during all trial intervals. 
For a statistical test of these differ- 
ences the data represented by the 
mean values shown in Fig. 1 and 2 
were used, except that all measures 
within a given interval were pooled 
into a single measure for that interval. 
Since the data were found not to meet 
the requirements for a parametric 
test, a nonparametric test—the Wil- 
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TABLE 1 


Mean Dirrerences Between Experimenrar 
anp Controt Group INCREMENTS OF 
Tension Expressep in 100 
Loc Microvo.ts 


Condition 


B (Palm Up) 


Extensor Flexor 


1.31 
3.24 


Interval A (Palm Down) 


Extensor Flexor 


“3.31% 
517° 


1.60*° 
4.33* 
9.22** 


5.39° 





18.82°* | 8.88°* 


coxon T test (7)—was used. Table 
1 shows the mean difference between 
experimental and control groups for 
each muscle in each interval. It can 
be seen that in each interval at least 
one of the two muscles of the experi- 
mental groups manifested  signifi- 
cantly more tension than their re- 
spective control group measures. 


Discussion 


The experimental results give a clear- 
cut answer to the two major questions 
raised. The significantly larger muscle 
potentials occurring for the experimental 
groups, as compared with their respective 
control groups, during passive hand 
movement (UCR interval) demonstrate 
that action potential responses do occur 
in muscles directly affected by passive 
hand movement. This indicates that 
“passive movements” are not truly 
passive, since they result in activation of 
muscles in the organism. 


The significantly larger responses oc- 
curring for the experimental groups 
during the CS and CR intervals demon- 
strate that conditioned muscle action 
potential responses can be developed in 
situations where the UCS is a passive 
movement. A CR occurred following 
the CS on test trials, and a conditioned 
anticipatory response occurred during 
the presentation of the CS on both 
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conditioning and test trials. The occur- 
rence of an increment in response during 
these intervals does not necessarily 
imply that the CR duplicates the UCR. 
It is possible, for example, that the 
tension increment is part of a general 
increase in tension of the organism 
preparatory to the UCS. Recording of 
action potentials from other parts of the 
body would be necessary to determine 
this. It is obvious, however, that con- 
ditioning of at least the extensor and 
flexor responses has taken place. 

An interesting consequence of the 
successful conditioning produced in the 
present experiment is the fact that 
conditioning has taken place in the 
apparent absence of overt reinforcement. 
This seems to provide support for con- 
tiguity learning theories as opposed to 
reinforcement learning theories. A pos- 
sible argument against this explanation 
is that the passive movement produced 
a “startle reflex.” This would con- 
stitute the UCR, and therefore any 
conditioning taking place would be 
conditioning of a startle response rather 
than conditioning of a passive movement. 
In measuring forearm extensor and flexor 
action potentials occurring during a 
startle response, however, Davis (4) 
found latency and decay times of po- 
tentials accompanying the startle re- 
sponse which were much longer than 
those of the UCR produced by passive 
movement. It therefore appears that 
although passive hand movement can be 
successfully used as a UCS, the response 
to passive movement does not involve 
reinforcement in an obvious manner. 


SUMMARY 


The experiment was designed as an investi- 
gation of passive movement. By recording 
action potentials from muscles most likely to 
be involved in passive hand movement, in- 
formation was obtained as to (a) the nature of 
muscular responses involved in passive hand 
movement, and (b) the conditioning of muscle 
action potentials resulting from these move- 
ments. Action potentials were recorded from 
the extensor digitorium and flexor digitorium 
muscles of the right forearm. In the condi- 
tioning procedure a white noise was paired with 
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passive hand movement for a number of con- 
ditioning triels. Interspersed among these were 
test trials during which the white noise was 
presented alone. One group of 10 Ss received 
a passive extension and another group of 10 Ss 
received a passive flexion of the right hand. 
For each experimental condition there was a 
control group of 10 Ss which received identical 
treatment except that no passive movement was 
given. 

It was found that muscle action potential 
responses are produced by passive movement 
and that conditioned muscle action potential 
responses occur when passive movement is used 
asthe UCS,. The conditioning appeared to take 
place with no obvious reinforcement of the 
response. 
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PROBLEM SOLVING IN MULTIPLE-GOAL SITUATIONS! 


SCARVIA B. ANDERSON 


Naval Research Laboratory * 


This experiment was designed to 
study the relationship between num- 
ber of plausible goals with which Ss 
are faced and performance on logical 
problems. It was hypothesized that 
with more than one goal, competing 
responses oriented toward different 
goals would tend to interfere with one 
another, and as number of goals 
increased amount of interference 
would «increase. Viewing problem 
solving as a sequential series of 
behaviors (not as an “all-or-none”’ 
affair), the aim of the experiment was 
to specify what form interference 
might take in this series of behaviors, 
where it would be most likely to 
occur, and whether it would be serious 
enough to impede solution. 


MetTHop 


Subjects.—The Ss were 48 Navy enlisted men; 
all had General Classification Test scores of 60 
(1 SD above the population mean of 50) or 
above. They were similar in age and number of 
years of schooling. 

Material.—The calculus of propositions was 
used as the source of problems (5, 6). Hach 
problem consisted of a series of premises from 
which a specified conclusion was to be drawn. 
Problems were presented in their abstract form, 
and no “meanings” were ascribed to the symbols 
used in them. Each S was trained in the use of 
12 transformation rules which he could apply to 


! This article is taken from a thesis submitted 
in partial fulfillment of the requirements for the 
degree of Doctor of Philosophy, University of 
Maryland, 1955. The author is indebted to 
Dr. C. N. Cofer for serving as advisor on the 
thesis and to Dr. T. G. Andrews and Dr. R. C. 
Hackman, University of Maryland, for sug- 
gestions and criticisms; to Dr. A. H. Ryan, 
formerly of the Naval Research Laboratory, for 
support which made the study possible; and to 
Dr. O. K. Moore, Yale University, for criticisms, 
especially with respect to points of logic. 

2 Now at the Educational Testing Service. 


the premises and the expressions derived from 
them in an attempt to reach the conclusion. 

Two sets of problems were used: 10 practice 
problems (varying widely in difficulty and 
demanding application of all 12 of the trans- 
formation rules) and four experimental problems 
(of approximately equivalent difficulty), Each 
of the latter contained three premises and 
required a minimum of six steps for solution. 
The problem shown above the horizontal line 
in Table 1 is one experimental problem, as it 
was presented to Ss under the one-goal condi- 
tion. Under the other three experimental 
conditions, the information in the upper right- 
hand sector was changed as follows: 


QorR 
P,Q, or R 
P,Q, R, or S 


2-goal condition 
3-goal condition 
4-goal condition 


Under the multiple-goal conditions, only one 
goal was actually deducible. However, Ss 
assumed all goals were deducible, and only three 
questioned whether it was possible to obtain 
even one of the goals about which they were 
asked. 

Practice sessions.—Following training in the 
use of the rules of the system (see 5), each S 
participated in seven practice sessions lasting a 
maximum of 30 min. each. During practice 
Sessions 1-3, two 15-min. problems were pre- 
sented; during Sessions 4-7, one 30-min. prob- 
lem was used. If S did not solve a problem 


TABLE 1 
ExampLe Prosiem 


(i) (R-S)vP 

(2) (PvS) D> (QvR) 

(3) ~Q 

*(4) Pv(R-S) 

*(5) (PvR)- (PvS) 
OvR 





* Steps in minimum set (called ‘relevant’ ). 
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within the time allowed, E demonstrated a 
solution. The problem-solving apparatus was 
identical to that used during experimental 
sessions. 

Experimental sessions.—A randomized block 
design was used with two replications; each S 
received each of the four problems and each of 
the four goal conditions, but the combinations 
and orders were different for each S within the 
same group of 16. Sessions lasted a maximum 
of 30 min. each on four successive working days. 
If S solved a problem before 30 min. had elapsed, 
he was allowed to leave. 

The S sat in a semidarkened room facing a 
projection screen. The E sat on the other side 
of a room divider by an overhead projector; he 
was not visible to S. At the beginning of a 
session, E turned on the projector displaying the 
set of premises and goal or goals appropriate to 
that session (¢.g., the part above the horizontal 
line in Table 1). If more than one goal was 
shown, E said, “You will notice that there are 
two (three, four) letters in Sector II [the upper 
right-hand sector]. We will consider that you 
have solved the problem when you get one of 
those.” ‘The S called out to E suggestions for 
steps in terms of the rule he wished to apply and 
the line(s) to which he wished to apply it; e.g., 
“Apply Rule 11 to Lines 13 and 3” (the order 
which would yield Line 14, Table 1). If the 
rule was applicable to the line(s) specified, E 
applied it by writing the result and the “justi- 
fication” on the face of the overhead projector; 
e.g., for Line 14, 'Table 1, he wrote 

(14) R (13,3) 11. 
If the rule was not applicable to the line or lines 
specified, E sounded a buzzer, indicating an 
error, and, of course, no new step was generated. 
The S was allowed to keep a rule sheet (attached 
to an edge-lighted clip-board) before him during 
all sessions, in order that his task not be pri- 
marily one of memorizing rules. During runs 
with the last 32 Ss, a graphic ammeter was used 
to record times for errors and correct steps. 

Problem-soloing responses.—The kinds of 
problem-solving responses in which E was 
interested can be illustrated with reference to 
Table 1. ‘The hypothetical S working on the 
problem took all six relevant steps in the 
minimum set required for solution (Lines 4, 5, 
10, 11, 13, and 14) and five irrelevant steps 
(Lines 6, 7, 8,9, and 12); thus he took 11 correct 
steps. ‘Time and errors are not shown in Table 
1, but if he made one error and worked a total 
of 16 min., activity level would be (11 4+ 1)/16 
or .75. His first relevant step was followed by 
another relevant step, this second relevant step 
was followed by an irrelevant step, his third 
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step was followed by an irrelevant step, 
etc. He introduced unnecessary material once 
(Line 6) and used this material in five (Lines 6, 
7, 8,9, and 12) or 45% of his steps. He used all 
premises. 

He produced two redundancies (Lines 8 and 
12). Two kinds of redundancy can be dis- 
tinguished logically and the distinction is perhaps 
meaningful psychologically: cycles and repeti- 
tions. A cycle generally involves using an 
expression (e.g., Line 6) as the source of another 
expression (e.g., Line 7) which is in turn used to 
obtain the first expression again (e.g., Line 8); 
the number of mediating steps may vary. A 
repetition results if exactly the same response is 
made twice (e.g., Line 12 is identical with Line 
9 and was obtained by the same means, applying 
Rule 6 to Line 8). 


Resutts * 


Solution.—The numbers of Ss ob- 
taining solution under the 1-, 2-,3-, and 
4-goal conditions were, respectively, 
21, 17, 15, and 10. These numbers 
were subjected to a type of analysis 
of variance suggested by Hoyt (3) for 
use in estimating the reliability of a 
test. The method of partitioning 
variation and the results are shown 
in Table 2. Number of solutions is 
an inverse function of number of 
goals. 

Sequence.—Figure 1 shows the pro- 
portions of the 48 Ss who obtained 
each of the successive relevant steps 
in the minimum set leading to the one 
deducible goal in problems. (Ir- 
relevant steps are not taken into 
account in this presentation.) It 
will be noted that 77, 79, 77, and 71% 
of Ss obtained the first relevant step 
with 1, 2, 3, and 4 goals, respectively, 
while 44, 35, 31, and 21% of Ss 
obtained Step 6 under the four goal 
conditions. The magnitude of the 
slopes of the curves gradually in- 
creases as number of goals increases. 


* Tabulations and complete analyses of raw 
data and verbal reports of Ss are included in the 
thesis, available through the Library of Congress 
on microfilm (Mic 56-261), 117 pages $1.46. 
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TABLE 2 


ANALYSES OF VARIANCE 


Solution 


MS 


Linear regression 

Nonlinear 
regression 

Goal conditious 

Individuals 

Residual 

Total 

















*P < 0S. 
=P < 01, 
- P < O01. 


The numbers of Ss obtaining Rele- 
vant Steps 1, 2, 3, 4, 5, and 6 were 
analyzed by the Hoyt method. Goal 
conditions began to correlate signifi- 
cantly (P < .05) with performance 
at Step 4, where 22, 18, 17, and 13 Ss 
obtained the step with 1-4 goals, 
respectively. The numbers of Ss 
obtaining Step 5 were the same as the 
numbers of Ss obtaining Step 6 
(solution). 

The empirical probabilities of prob- 
lem solution, knowing S has been 
given the problem, has taken relevant 
Step 1, has taken Step 2, has taken 
Step 3, etc., are shown in Fig. 2. The 
first set of points (So > S¢), of course, 
corresponds to the last set of points 
(Step 6) in Fig. 1. The sequential 
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Fic.1. Proportions of Ss taking sequential rele- 
vant steps in 1-, 2-, 3-, and 4-goal situations. 
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Fic. 2. Probabilities of problem solution 
(Ss), given the problem alone (So), Step 1 (S,), 
etc. in 1-, 2-, 3-, and 4-goal situations. 


probability curve for 1 goal is a 
smooth increasing function, successive 
steps providing fairly regular incre- 
ments to probability of solution. 
With 2, 3, and 4 goals, however, there 
is a very apparent change of slope in 
the functions. Taking Steps 1 and 
2 increases somewhat the probability 
of solution, but a far greater increase 
occurs once S has taken Step 3 and 
from there on. Up to Step 5, the 
taking of which essentially guarantees 
solution for all goal conditions, the 
sequential probabilities of solution, 
knowing S’s progress to any point, 
rank in inverse relationship to the 
number of goals with which S is faced. 

Figure 3 presents the empirical 
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probabilities of sequential relevant 
steps. For Ss working with 2, 3, or 
4 goals there is an increase in the 
probability of Step 2 given Step 1 over 
the probability of Step 1 given the 
problem alone, but there is a sharp 
decrease in the probability ‘of Step 3 
given Step 2, as compared with the 
preceding and succeeding probabilities 
of sequential steps. 

The data in Fig. 4 provide some 
information on _ successive steps, 
whether relevant or irrelevant. The 
probability that S’s first step will be 
irrelevant is fairly high for all goal 
conditions (.48 to .58). The proba- 
bility that S will take an irrelevant 
step immediately following relevant 
Step | is considerably lower. For 2, 
3, and 4 goals, the probability of an 
irrelevant step following relevant Step 
2 shows a more marked increase than 
for one goal. The curves of Fig. 4 
represent essentially an inversion of 
the curves of Fig. 3; i.e., as the proba- 
bility of an irrelevant step goes up in 
Fig. 4, the probability of a relevant 
step goes down in Fig. 3. 

Curves similar in shape to those in 
Fig. 4 were obtained when the proba- 
bilities of relevant steps immediately 
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Fic. 3. Probabilities of sequential relevant 
steps (probability of taking Step 1 given the 
problem—So 23 §,, probability of taking Step 2 
given Step 1—S,; 3 Sz, etc.) in 1-, 2-, 3-, and 


4-goal situations. 
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Fic. 4. Probabilities of irrelevant steps (S;) 
immediately following the problem alone (So) 
or relevant steps (S,, Sq, etc.) in 1-, 2-, 3-, and 
4-goal situations. 


following one or more irrelevant steps 
were plotted. The difference between 
the two sets of curves is in the main 
one of ordinate distance, the proba- 
bility of a relevant step following an 
irrelevant step never exceeding the 
probability of an irrelevant step fol- 
lowing a preceding relevant step. 

Redundancy.—On 112 of the 192 
problem protocols, no redundancies 
occurred. Only 13 of the 48 Ss, 
however, failed to cycle or repeat at 
some time during their work on the 
four problems. The numbers of Ss 
who did not cycle or repeat were 33, 
31, 27, and 21 for 1-4 goals—a highly 
significant linear trend. The results 
of the Hoyt analysis of variance are 
summarized in Table 2. 

Introduction of unnecessary material 
and use of necessary material.—Two 
of the 12 transformation rules taught 
to Ss allowed them to introduce 
material into a problem, material 
which might or might not be dupli- 
cated in the premises initially given. 
In only one problem was S required 
to use one of these rules to bring 
anything into a problem. All other 
cases in which material was intro- 
duced were tabulated. In 101 prob- 
lem-solving attempts, Ss introduced 
unnecessary material; in 91 they did 
not. Thirty of the 48 Ss under the 





PROBLEM SOLVING IN MULTIPLE-GOAL SITUATIONS 


l-goal condition did not introduce 
unnecessary material. Numbers 
under the 2-, 3-, and 4-goal conditions 
were 25, 20, and 16. This linear 
trend is significant beyond the .O1 
level (see Table 2). 

If unnecessary material was once 
introduced into a problem, under all 
goal conditions, Ss on the average 
utilized it in the majority of their 
steps. The 18 Ss who introduced 
unnecessary material in the 1-goal 
condition utilized it in an average of 
59% of their steps. The 23 Ss under 
the 2-goal condition used it on the 
average in 53% of their steps. The 
mean percentages of steps utilizing 
unnecessary material for the 28 Ss 
under the 3-goal condition and 
the 32 Ss under the 4goal condi- 
tion who introduced such material 
were, respectively, 57% and 63%, 
One, two, four, and five Ss under the 
l-, 2-, 3-, and 4goal conditions 
employed unnecessary material in 
every one of the steps they took. 


Each of the experimental problems 
contained three premises, the given 


information. Under the _ l-goal 
condition, 35 out of 48 Ss used all 
premises; the corresponding numbers 
under the 2-, 3-, and 4-goal conditions 
were, respectively, 37, 36, and 28. 
This variation is, however, not sta- 
tistically significant (see Table 2). 
Efficiency and error.—The 48 Ss 
took a total of 449 correct steps under 
the l-goal condition, 469 under the 
2-goal condition, 461 under the 
3-goal condition, and 497 under 
the 4goal condition. Of these, 
293, 323, 326, and 384 were irrelevant 
under the l-, 2-, 3-, and 4-goal con- 
ditions, respectively. Each S was 
assigned an efficiency score, defined 
as the ratio of relevant steps to total 
correct steps, for each problem on 
which he worked. Many of these 
scores were zero, and it was advisable 
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to use a distribution-free test of 
significance. Assuming, as with the 
Hoyt technique, that Goal X Prob- 
lem and Goal X Trial interactions 
behaved as random variates, the 
median test (4, Formula 109) was 
employed. With 1 goal 28 out of 
the 48 scores were above the medians 
for their respective rows (correspond- 
ing to individuals). With 2 goals 
23 were above the medians, with 
3 goals 27, and with 4 goals 18. 
Chi square (df = 3) is 3.86, corre- 
sponding to a probability value be- 
tween .20 and .30. 

Examination of the 192 problem 
protocols showed that in 73 cases no 
errors (violations of transformation 
rules) were made. The numbers of 
Ss not making any errors under the 
four goal conditions were 17, 18, 24, 
and 14, respectively. As Table 2 
indicates, number of goals is not 
significantly related to failure to make 
errors. 

Activity level and timing.—Numbers 
of responses per minute of working 
time for each S for each problem, 
condition, and trial were subjected to 
eight analyses of variance (one for 
each set of 48 independent scores). 
The over-all results indicated that 
activity level does not vary signifi- 
cantly with goal conditions, problems, 
trials, or interactions between goal 
conditions and trials or problems. 

There were 20 unsolved problems 
each under the Il- and 2-goal condi- 
tions, 23 unsolved problems under 
the 3-goal condition, and 25 under 
the 4-goal condition for the 32 Ss for 
whom detailed time records were kept. 
The 30-min. working time was di- 
vided into 10 3-min. intervals, and 
the mean cumulative percentages of 
responses by the end of each interval 
for each of the four goal conditions 
were computed, There is no indi- 
cation that Ss working (and not 
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solving) under the different condi- 
tions distributed their efforts very 
much differently. In all four con- 
ditions, somewhat more than half 
(56% to 66%) of the responses were 
made by the end of the half-way mark 
in time (15 min.). 

For all Ss who solved their prob- 
lems, cumulative percentages of 
problems solved by the-end of suc- 
cessive 3-min. time intervals were 
computed. The Kolmogorov- 
Smirnov (7, p. 427) test of the 
difference beween sample cumulative 
percentage polygons yielded insig- 
nificant results when the distributions 
for the four goal conditions were 
examined. 


Discussion 


Not all Ss faced with one obtainable 
goal solved their problems. Their fail- 
ures may be attributed to general and 
specific characteristics of the individuals 
and “chance.” But fewer Ss solved 
their problems when faced with pseudo 


goals in addition to the “real” goal. It 


is assumed that similar general and 
specific characteristics and ‘‘chance” 
affected their performances, so the differ- 
ential is attributed to interference. 

At what stage in the problem-solving 
sequence does interference occur? From 
the results, it is judged that it occurs 
after the initial restructuring steps. The 
§ first rearranges one or more premises 
into a more workable form. The initial 
restructurings are probably suggested by 
the premises themselves and the forms 
of the rules and are not importantly 
dependent on the goal to be deduced. 
The majority of Ss under all conditions 
get the first two relevant steps for a 
problem. The interval between initial 
restructuring steps and steps more 
specific to the “‘real’’ goal is the crucial 
interval for Ss faced, in addition, with 
one or more pseudo goals. The proba- 
bility of an irrelevant step occurring 
increases at this point, but the corre- 
sponding probability that once S has 
taken an irrelevant step he will be able 
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to get back on the right track does not 
increase by so large an amount. If § 
can get by this crucial interval, if he can 
once again pick up the chain of steps 
leading to the obtainable goal, his 
chances of problem solution are almost as 
high as those for Ss not confronted with 
pseudo goals. 

What kinds of irrelevancies occur to 
distract Ss from the path to the ob- 
tainable goal? Introduction of unneces- 
sary material is one kind. It has been 
shown that once § has introduced un- 
necessary material, the probability that 
it will occupy a large part of his problem- 
solving effort is high; and the probability 
that unnecessary material will be intro- 
duced increases as the number of goals 
increases. The S’s behavior under these 
circumstances may be described as pre- 
occupation with his own products. He 
tends to ignore material provided by E£, 
attempting to pull a solution “‘out of the 
air.” Related to S’s use of his own 
material is a tendency for the 4-goal 
condition toward an increase in the 
probability that he will not use all 
premises provided to him by E. 

The increase in the probability of 
redundant activity as number of goals 
increases provides evidence for the 
interference hypothesis. It suggests 
that S takes a step or series of steps 
oriented toward one goal, is distracted 
by another goal, and forgets what he 
had done before or where his set of steps 
was leading. In one experiment (2) an 
increase in redundant activity for the 
experimental group did not accompany 
a significant decrement in number of 
solutions, and it was suggested that some 
psychological advantages of redundancy 
perhaps outweighed its superficial dis- 
advantages. Another study (1), how- 
ever, showed a significant though low 
negative correlation between number of 
cycles and repetitions and progress 
toward problem solution. In the present 
study, since cycles and repetitions are an 
increasing function of number of goals 
and number of solutions is a decreasing 
function, we cannot discount the possi- 
bility that redundancy may have served 
to impede solution. 
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SUMMARY 


Number of plausible goals with which S was 
faced was introduced as the independent variable 


in an experiment on human problem solving. 
Each of the 48 Navy Ss worked with 1, 2, 3, and 
4 goals in problems drawn from the calculus of 
propositions. The results, interpreted in terms 
of interference between responses oriented 


toward competing goals, indicate that (4) num- 
ber of solutions is an inverse function of number 
of goals, (b) sequential probabilities of relevant 
and irrelevant problem-solving steps vary as 
functions of number of goals, (c) number of Ss 
engaging in redundant activity and number of 
Ss introducing unnecessary material to problems 
are direct functions of number of goals, (d) effi- 
ciency scores and number of Ss using all given 
material show slight but statistically insignificant 
tendencies to vary as functions of number of 
goals, and (¢) violations of rules for working on 
problems, over-all activity level, and distri- 
bution of effort over time are not related to 
number of goals. 


. McNemar, Q. 
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